daVinci-Dev: Agent-native Mid-training for Software Engineering Paper โข 2601.18418 โข Published Jan 26 โข 124
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling Paper โข 2510.20206 โข Published Oct 23, 2025 โข 12
Running Featured 131 Open VLM Video Leaderboard ๐ 131 VLMEvalKit Eval Results in video understanding benchmark
Running on CPU Upgrade 13.9k Open LLM Leaderboard ๐ 13.9k Track, rank and evaluate open LLMs and chatbots
view article Article Smol2Operator: Post-Training GUI Agents for Computer Use +3 Sep 23, 2025 โข 137