Transformers
GGUF
English
256k context
Qwen3
Mixture of Experts
MOE
MOE Dense
2 experts
4Bx12
All use cases
bfloat16
Merge
thinking
reasoning
GPT-5.1-High-Reasoning-Distill
Gemini-3-Pro-Preview-High-Reasoning-Distill
Claude-4.5-Opus-High-Reasoning-Distill
Claude-Sonnet-4-Reasoning-Distill
Kimi-K2-Thinking-Distill
Gemini-2.5-Flash-Distill
Gemini-2.5-Flash-Lite-Preview-Distill
gpt-oss-120b-Distill
GLM-Flash-4.6-Distill
Open-R1-Distill
Command-A-Reasoning-Distill
conversational