cyankiwi/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit Text Generation • 15B • Updated 8 days ago • 88.7k • 61
Quartet: Native FP4 Training Can Be Optimal for Large Language Models Paper • 2505.14669 • Published May 20, 2025 • 78
Cell2Sentence Models Collection Cell2Sentence models trained for single-cell tasks • 5 items • Updated Apr 16, 2025 • 16
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published Apr 9, 2025 • 77