Grid2Matrix: Revealing Digital Agnosia in Vision-Language Models Paper • 2604.09687 • Published Apr 14 • 8
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents Paper • 2604.04247 • Published Apr 5 • 31
Can AI Agents Answer Your Data Questions? A Benchmark for Data Agents Paper • 2603.20576 • Published Mar 21 • 3
SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing Paper • 2603.08982 • Published Mar 9 • 16
V_1: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published Feb 13 • 59
AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions Paper • 2602.06008 • Published Feb 5 • 5
MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Paper • 2512.16909 • Published Dec 18, 2025 • 3
Can Large Vision Language Models Read Maps Like a Human? Paper • 2503.14607 • Published Mar 18, 2025 • 10
Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets Paper • 2505.15517 • Published May 21, 2025 • 7
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild Paper • 2401.10171 • Published Jan 18, 2024 • 14
NeRFiller: Completing Scenes via Generative 3D Inpainting Paper • 2312.04560 • Published Dec 7, 2023 • 13
Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping Paper • 2309.07970 • Published Sep 14, 2023 • 8