Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Optimization
community
Activity Feed
Follow
26
AI & ML interests
None defined yet.
Recent Activity
krishnateja95
updated
a collection
about 19 hours ago
HIGGS-stiched
krishnateja95
updated
a collection
about 19 hours ago
HIGGS-stiched
krishnateja95
updated
a collection
about 19 hours ago
HIGGS-stiched
View all activity
Team members
17
inference-optimization
's models
334
Sort: Recently updated
inference-optimization/Qwen3-8B_5.5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
1
inference-optimization/Qwen3-8B_5.5_bits_mode_noise
6B
•
Updated
Mar 12
•
2
inference-optimization/Qwen3-8B_5.5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
2
inference-optimization/Qwen3-8B_5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
2
inference-optimization/Qwen3-8B_5_bits_mode_noise
6B
•
Updated
Mar 12
•
1
inference-optimization/Qwen3-8B_5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_heuristic
7B
•
Updated
Mar 12
•
2
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_noise
7B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_hybrid
7B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_heuristic
7B
•
Updated
Mar 12
•
2
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_noise
7B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_hybrid
7B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_heuristic
6B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_noise
6B
•
Updated
Mar 12
•
2
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_hybrid
6B
•
Updated
Mar 12
•
6
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_noise
6B
•
Updated
Mar 12
•
2
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
2
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
4
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_noise
6B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
4
inference-optimization/sarvam-105b-FP8-Dynamic
Text Generation
•
106B
•
Updated
Mar 9
•
9
inference-optimization/sarvam-30b-FP8-Dynamic
Text Generation
•
32B
•
Updated
Mar 9
•
54
•
1
inference-optimization/sarvam-30b-NVFP4
Text Generation
•
19B
•
Updated
Mar 9
•
15
•
1
inference-optimization/sarvam-105b-NVFP4
61B
•
Updated
Mar 9
•
3
•
1
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic
35B
•
Updated
Mar 6
•
3
inference-optimization/gpt-oss-20b-FP8-Dynamic
21B
•
Updated
Mar 5
•
12
•
1
inference-optimization/Qwen3-30B-A3B-Instruct-2507-NVFP4
17B
•
Updated
Mar 4
•
49
inference-optimization/Qwen3-30B-A3B-Instruct-2507-FP8-Dynamic
31B
•
Updated
Mar 4
•
50
inference-optimization/Qwen3-30B-A3B-Instruct-2507-FP8-Block
31B
•
Updated
Mar 4
•
2
Previous
1
...
8
9
10
11
12
Next