The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 3 days ago • 57
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper • 2601.15876 • Published 2 days ago • 68
BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries Paper • 2601.15197 • Published 3 days ago • 51
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 4 days ago • 28
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family 5 days ago • 57
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization Paper • 2601.12993 • Published 5 days ago • 71
Qwen Image Edit (exps) Collection adapter LoRA developed for Qwen’s Qwen-Image-Edit-2511 image-to-image model • 12 items • Updated about 23 hours ago • 2
YOLO26 Models Collection YOLO26 models: detection, segmentation, classification, pose, and OBB variants with demos and ONNX variants. • 42 items • Updated 5 days ago • 27
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5 Paper • 2601.10527 • Published 9 days ago • 23
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 9 days ago • 33
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes Paper • 2601.05249 • Published 16 days ago • 46
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 16 days ago • 204
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published about 1 month ago • 30