Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 6 days ago • 304
Appear2Meaning: A Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images Paper • 2604.07338 • Published 6 days ago • 4
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 12 days ago • 464
Devy1/Qwen2.5-Coder-CONTROL-checkpoints_multi_language_2k-1.5B-Base-3 2B • Updated 12 days ago • 18 • 1
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 263