Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Providers
·
Metrics for top trending models
Browse all models
Learn more
Reset
Model
Provider
Input $/1M
Output $/1M
Context
Latency(s)
Throughput(t/s)
Tools
Structured
Qwen/Qwen3-32B
Qwen3-32B
groq
$0.29
$0.59
131,072
0.29
242
Yes
No
Qwen/Qwen3-32B
Qwen3-32B
novita
$0.10
$0.45
40,960
0.74
46
No
No
Qwen/Qwen3-32B
Qwen3-32B
cerebras
fastest
$0.40
$0.80
-
0.34
1,083
Yes
No
Qwen/Qwen3-32B
Qwen3-32B
sambanova
$0.40
$0.80
32,768
2.81
188
Yes
Yes
Qwen/Qwen3-32B
Qwen3-32B
nscale
cheapest
$0.08
$0.25
40,960
0.99
28
Yes
No
Qwen/Qwen3-32B
Qwen3-32B
ovhcloud
$0.09
$0.25
32,768
0.51
41
Yes
Yes