Claw-SWE-Bench: A Benchmark for Evaluating OpenClaw-style Agent Harnesses on Coding Tasks Paper • 2606.12344 • Published 20 days ago • 70
usermma/Huihui-MiniCPM5-1B-abliterated-mlx-4Bit Text Generation • 0.2B • Updated 27 days ago • 115 • 1
Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration Paper • 2605.28184 • Published May 27 • 6
Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Paper • 2605.22109 • Published May 21 • 171
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 103
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published Mar 26 • 155
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 248