Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution Paper • 2602.12684 • Published 4 days ago • 3 • 2
Self-EvolveRec: Self-Evolving Recommender Systems with LLM-based Directional Feedback Paper • 2602.12612 • Published 4 days ago • 2 • 2
GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics Paper • 2602.12617 • Published 4 days ago • 18 • 2
DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels Paper • 2602.11715 • Published 5 days ago • 5 • 3
Light4D: Training-Free Extreme Viewpoint 4D Video Relighting Paper • 2602.11769 • Published 5 days ago • 2 • 2
BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models Paper • 2602.04163 • Published 13 days ago • 6 • 2
FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching Paper • 2602.12829 • Published 4 days ago • 3 • 2
On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs Paper • 2602.12506 • Published 4 days ago • 3 • 1
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception Paper • 2602.11858 • Published 5 days ago • 50 • 2
scPilot: Large Language Model Reasoning Toward Automated Single-Cell Analysis and Discovery Paper • 2602.11609 • Published 5 days ago • 1 • 2
ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning Paper • 2602.11236 • Published 5 days ago • 10 • 3
Learning Image-based Tree Crown Segmentation from Enhanced Lidar-based Pseudo-labels Paper • 2602.13022 • Published 4 days ago • 1 • 2
CoPE-VideoLM: Codec Primitives For Efficient Video Language Models Paper • 2602.13191 • Published 3 days ago • 21 • 2
SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents Paper • 2602.12984 • Published 4 days ago • 4 • 2
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs Paper • 2602.10388 • Published 6 days ago • 202 • 3
GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning Paper • 2602.04315 • Published 13 days ago • 1 • 2
SemanticMoments: Training-Free Motion Similarity via Third Moment Features Paper • 2602.09146 • Published 7 days ago • 17 • 2
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs Paper • 2602.12705 • Published 4 days ago • 56 • 5
Code2Worlds: Empowering Coding LLMs for 4D World Generation Paper • 2602.11757 • Published 5 days ago • 3 • 2
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost Paper • 2602.03120 • Published 14 days ago • 1 • 2