Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated a model 1 day ago
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-Dynamic updated a model 1 day ago
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16-FP8-BLOCK updated a model 1 day ago
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16-W4A16-G128