Simulating Financial Market via Large Language Model based Agents Paper • 2406.19966 • Published Jun 28, 2024
CoSineVerifier: Tool-Augmented Answer Verification for Computation-Oriented Scientific Questions Paper • 2512.01224 • Published Dec 1, 2025
Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models Paper • 2512.06266 • Published Dec 6, 2025 • 6
From Failure to Mastery: Generating Hard Samples for Tool-use Agents Paper • 2601.01498 • Published Jan 4 • 2
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 8 days ago • 298
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published 10 days ago • 39
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 8 days ago • 51
FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling Paper • 2510.24645 • Published Oct 28, 2025 • 10
Running 84 Unlocking On-Policy Distillation for Any Model Family 📝 84 Explore on‑policy distillation with interactive visualizations