Measuring Maximum Activations in Open Large Language Models Paper โข 2605.15572 โข Published 8 days ago โข 18
EndPrompt: Efficient Long-Context Extension via Terminal Anchoring Paper โข 2605.14589 โข Published 9 days ago โข 14
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook ๐ 3.18k The secrets to building world-class LLMs