useful sharded checkpoints for users to run inference / fine-tuning on a Google colab without having to deal with CPU OOM issues.
Younes B
ybelkada
AI & ML interests
Large Language Models, Quantization, Vision, Multimodality, Diffusion models
Recent Activity
liked
a model 15 days ago
tiiuae/Falcon-H1R-7B-FP8 liked
a model 19 days ago
Aratako/MioTTS-GGUF liked
a model 20 days ago
Aratako/MioTTS-0.1B