LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing Paper • 2606.26740 • Published 8 days ago • 78
ViQ: Text-Aligned Visual Quantized Representations at Any Resolution Paper • 2606.27313 • Published 8 days ago • 38
The Verification Horizon: No Silver Bullet for Coding Agent Rewards Paper • 2606.26300 • Published 9 days ago • 46
view article Article Kog Laneformer 2B: The Latency-First Model Behind Kog Inference Engine kogai • 8 days ago • 32
BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation Paper • 2606.19651 • Published 16 days ago • 10
PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models Paper • 2606.19534 • Published 16 days ago • 64
Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance Paper • 2606.19195 • Published 16 days ago • 139
Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings Paper • 2606.07502 • Published 28 days ago • 99
SWE-Explore: Benchmarking How Coding Agents Explore Repositories Paper • 2606.07297 • Published 28 days ago • 122
Draft-OPD: On-Policy Distillation for Speculative Draft Models Paper • 2605.29343 • Published May 28 • 36
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models nvidia • Jul 18, 2025 • 51
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published Apr 9 • 103
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published Mar 12 • 23
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 18 days ago • 161
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device Paper • 2602.20161 • Published Feb 23 • 23
SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models Paper • 2602.18993 • Published Feb 22 • 4