On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers Paper • 2603.28762 • Published 9 days ago • 25
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published 13 days ago • 152
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published 13 days ago • 154
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data Paper • 2603.25319 • Published 13 days ago • 32
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published 14 days ago • 26
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models Paper • 2603.22212 • Published 16 days ago • 125
Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation Paper • 2603.21884 • Published 16 days ago • 5
WorldCache: Content-Aware Caching for Accelerated Video World Models Paper • 2603.22286 • Published 16 days ago • 4
Versatile Editing of Video Content, Actions, and Dynamics without Training Paper • 2603.17989 • Published 21 days ago • 17