Running on CPU Upgrade Featured 3.2k The Smol Training Playbook 📚 3.2k The secrets to building world-class LLMs
Running 181 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 181 Building and scaling RL environments for LLM training
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning Paper • 2512.24265 • Published Dec 30, 2025 • 4
UltraData Collection Ultra Scale, Ultra Quality, Ultra Coverage • 11 items • Updated 12 days ago • 97
Does your data spark joy? Performance gains from domain upsampling at the end of training Paper • 2406.03476 • Published Jun 5, 2024 • 4