MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper β’ 2509.22186 β’ Published Sep 26, 2025 β’ 159
π΅ The MusicBox Collection A collection full of musical tasks demos, for musicians & music enthusiasts β’ 39 items β’ Updated 5 days ago β’ 33
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper β’ 2410.10792 β’ Published Oct 14, 2024 β’ 31
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data Paper β’ 2409.03810 β’ Published Sep 5, 2024 β’ 35
Configurable Foundation Models: Building LLMs from a Modular Perspective Paper β’ 2409.02877 β’ Published Sep 4, 2024 β’ 32
MobileQuant: Mobile-friendly Quantization for On-device Language Models Paper β’ 2408.13933 β’ Published Aug 25, 2024 β’ 16