From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 3 days ago • 65
ResearchMath-14K: Scaling Research-Level Mathematics via Agents Paper • 2605.28003 • Published 3 days ago • 43
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation Paper • 2605.28293 • Published 3 days ago • 78
[Trimming] Qwen3 Embedding 0.6B Collection Collection of trimmed Qwen's Qwen3-Embedding-0.6B models. The models are sorted alphabetically. • 166 items • Updated 1 day ago • 1
MTP Qwen 3.5/3.6 Stable Collection Collection of Qwen 3.5/3.6 MTP Featuring GGUF • 4 items • Updated 1 day ago • 1
RFDetr Collection RF-DETR checkpoints converted to be used with 🤗 Transformers • 15 items • Updated 3 days ago • 13
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 8 days ago • 41
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 4 days ago • 118
Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction Paper • 2605.26230 • Published 5 days ago • 38
SpatialBench: Is Your Spatial Foundation Model an All-Round Player? Paper • 2605.27367 • Published 4 days ago • 66
QIE/FRIE1.1 — Test LoRAs [May2026] Collection Collection of Qwen Image Editing LoRAs • 4 items • Updated 4 days ago • 1
DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning Paper • 2605.25604 • Published 5 days ago • 130
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation Paper • 2605.25874 • Published 5 days ago • 97
SEGA: Spectral-Energy Guided Attention for Resolution Extrapolation in Diffusion Transformers Paper • 2605.22668 • Published 9 days ago • 40
PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects Paper • 2605.21572 • Published 10 days ago • 51