Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training Paper • 2602.07824 • Published 12 days ago • 15
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published 7 days ago • 20
view article Article Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs 24 days ago • 21
Running 37 Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale 📝 37 Generate text using extremely small yet powerful language models
tiiuae/Falcon-H1-Tiny-90M-Instruct-Curriculum-pre-DPO Text Generation • 91.1M • Updated Jan 15 • 17 • 1