OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning Paper • 2606.26790 • Published 3 days ago • 40
Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence Paper • 2606.15932 • Published 12 days ago • 36
The Hitchhiker's Guide to Agentic AI: From Foundations to Systems Paper • 2606.24937 • Published 6 days ago • 14
FLUX3D: High-Fidelity 3D Gaussian Generation with Diffusion-Aligned Sparse Representation Paper • 2606.24874 • Published 5 days ago • 1
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 5 days ago • 133
Emotions Where Art Thou: Understanding and Characterizing the Emotional Latent Space of Large Language Models Paper • 2510.22042 • Published Jan 30 • 1
Emotion Concepts and their Function in a Large Language Model Paper • 2604.07729 • Published Apr 9 • 1
Making Qwen3 Think in Korean with Reinforcement Learning Paper • 2508.10355 • Published Aug 14, 2025 • 2
Everyday Physics in Korean Contexts: A Culturally Grounded Physical Reasoning Benchmark Paper • 2509.17807 • Published Sep 22, 2025 • 2
VARCO-VISION: Expanding Frontiers in Korean Vision-Language Models Paper • 2411.19103 • Published Nov 28, 2024 • 22
Can Large Models Teach Student Models to Solve Mathematical Problems Like Human Beings? A Reasoning Distillation Method via Multi-LoRA Interaction Paper • 2508.13037 • Published Aug 18, 2025 • 2
Machine Psychology: Integrating Operant Conditioning with the Non-Axiomatic Reasoning System for Advancing Artificial General Intelligence Research Paper • 2405.19498 • Published May 29, 2024 • 1
Dense Reward for Multi-View 3D Reasoning with Global Maps and Local Views Paper • 2606.23557 • Published 6 days ago • 5
PoLAR: Factorizing Extent and Mode in Latent Actions for Robot Policy Learning Paper • 2606.21139 • Published 9 days ago • 9
Selective Synergistic Learning for Video Object-Centric Learning Paper • 2606.15527 • Published 14 days ago • 4
EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts Paper • 2606.18967 • Published 11 days ago • 24
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 12 days ago • 207