swiss-ai/Apertus-8B-Instruct-2509 Text Generation β’ 8B β’ Updated about 1 hour ago β’ 199k β’ β’ 446
Running 3.8k The Ultra-Scale Playbook π 3.8k The ultimate guide to training LLM on large GPU Clusters