Enhancing Multi-Image Understanding through Delimiter Token Scaling Paper • 2602.01984 • Published 6 days ago • 5
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation Paper • 2510.07959 • Published Oct 9, 2025 • 15
Does Data Scaling Lead to Visual Compositional Generalization? Paper • 2507.07102 • Published Jul 9, 2025 • 2
Does Data Scaling Lead to Visual Compositional Generalization? Paper • 2507.07102 • Published Jul 9, 2025 • 2 • 1
Does Data Scaling Lead to Visual Compositional Generalization? Paper • 2507.07102 • Published Jul 9, 2025 • 2
OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion Paper • 2507.06165 • Published Jul 8, 2025 • 60
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning Paper • 2507.05920 • Published Jul 8, 2025 • 12
Is Diversity All You Need for Scalable Robotic Manipulation? Paper • 2507.06219 • Published Jul 8, 2025 • 21
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance Paper • 2506.03828 • Published Jun 4, 2025 • 15
ImgEdit: A Unified Image Editing Dataset and Benchmark Paper • 2505.20275 • Published May 26, 2025 • 18
Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper • 2505.21189 • Published May 27, 2025 • 61
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper • 2505.18445 • Published May 24, 2025 • 63
MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs Paper • 2505.21327 • Published May 27, 2025 • 83