Effective Distillation to Hybrid xLSTM Architectures Paper • 2603.15590 • Published Mar 16 • 33 • 5
Effective Distillation to Hybrid xLSTM Architectures Paper • 2603.15590 • Published Mar 16 • 33
nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1 Text Generation • 5B • Updated Oct 15, 2025 • 25.2k • 114