Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
NVFP4
/
Qwen3-235B-A22B-Instruct-2507-FP4
like
3
Follow
NVFP4
98
Text Generation
Safetensors
Model Optimizer
qwen3_moe
nvidia
ModelOpt
Qwen3
quantized
FP4
conversational
8-bit precision
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
Cannot be loaded in vllm
1
#3 opened 4 months ago by
mratsim
quantization method?
1
#2 opened 6 months ago by
chriswritescode