Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper β’ 2602.24289 β’ Published 7 days ago β’ 36
Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models Paper β’ 2512.19686 β’ Published Dec 22, 2025
VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction Paper β’ 2602.13294 β’ Published 26 days ago β’ 13
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper β’ 2602.06028 β’ Published 29 days ago β’ 36
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper β’ 2602.06028 β’ Published 29 days ago β’ 36 β’ 7
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper β’ 2602.06028 β’ Published 29 days ago β’ 36 β’ 7
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper β’ 2602.06028 β’ Published 29 days ago β’ 36 β’ 7
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper β’ 2602.06028 β’ Published 29 days ago β’ 36 β’ 7
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper β’ 2602.06028 β’ Published 29 days ago β’ 36
ShowUI-Ο: Flow-based Generative Models as GUI Dexterous Hands Paper β’ 2512.24965 β’ Published Dec 31, 2025 β’ 42
GARDO: Reinforcing Diffusion Models without Reward Hacking Paper β’ 2512.24138 β’ Published Dec 30, 2025 β’ 29
MoCha Collection The pioneering work in Dialogue-driven Movie Shot Generation β’ 4 items β’ Updated Dec 27, 2025 β’ 2