veg
ciprianv
AI & ML interests
None yet
Recent Activity
liked a model 16 days ago
rdtand/Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm liked a model 17 days ago
XiaomiMiMo/MiMo-V2.5 liked a model 17 days ago
festr2/MiMo-V2.5-Pro-NVFP4-MXFP8-attn-TP8Organizations
None yet
working vllm or sglang command
#1 opened 21 days ago
by
ciprianv
Looping in OpenCode
👀 1
5
#4 opened about 1 month ago
by
Jon-Nielsen
GPTQ vs Q4 GGUF
👀 3
1
#2 opened 3 months ago
by
ciprianv
Cant get it to work on 8x RTX3090
14
#1 opened 4 months ago
by
maglat
"w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected."
👍 1
21
#2 opened 4 months ago
by
zenmagnets
accuracy
26
#4 opened 4 months ago
by
ktsaou
Fastest for my 3090x8
2
#1 opened 4 months ago
by
ciprianv
Please create also Minimax 2.1 REAP versions
2
#1 opened 5 months ago
by
ciprianv
Report: getting 20 t/s with UD-Q4_K_XL and 72 VRAM
🔥 2
10
#2 opened 5 months ago
by
SlavikF
Hot Damn This Model Cooks!
👍 6
12
#5 opened 5 months ago
by
aaron-newsome
Please make 4 bit dwq mlx quant
2
#1 opened 5 months ago
by
Narutoouz
Please update llama.cpp to see improved performance!
🚀 4
4
#7 opened 6 months ago
by
danielhanchen
Updated Title: UDQ4_K_XL - Great Rust coder
👍 3
5
#11 opened 10 months ago
by
wonderfuldestruction
download link creates Q5_K_M instead of UD-Q5_K_XL named files
1
#2 opened 11 months ago
by
ciprianv
Confused about the eval score
❤️ 2
3
#15 opened 11 months ago
by
Denisssy
IQ3_KS metrics on mixed CUDA + CPU, pretty good model!
🔥 2
34
#2 opened 11 months ago
by
Panchovix
What are the recommended settings?
1
#7 opened 11 months ago
by
ciprianv
Thanks for your work! Any chance for something between Q2_K_R and Q3_K_R?
👍👀 5
19
#7 opened about 1 year ago
by
Panchovix
Update - Tool Calling + Chat Template bug fixes
9
#20 opened 12 months ago
by
danielhanchen