amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-cpu Text Generation • Updated Jan 30, 2025
Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA Collection ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU • 8 items • Updated Feb 19 • 8
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots