veg

ciprianv

·

AI & ML interests

None yet

Recent Activity

liked a model about 21 hours ago

vessl/Kimi-K3-W4AFP8

liked a model 6 days ago

baseten/GLM-5.2-Vision-NVFP4

liked a model 12 days ago

UCloud-org/GLM-5.2-FP8-DFlash

View all activity

Organizations

None yet

New activity in XiaomiMiMo/MiMo-V2.5-Pro-FP4-DFlash about 1 month ago

Vllm config

#6 opened about 1 month ago by

New activity in z-lab/GLM-5.1-FP8-DFlash about 1 month ago

sm120 vllm

#2 opened about 2 months ago by

New activity in INC4AI/MiMo-V2.5-Pro-int4-mixed 2 months ago

working vllm or sglang command

#1 opened 2 months ago by

New activity in lukealonso/MiMo-V2.5-NVFP4 2 months ago

Looping in OpenCode

#4 opened 3 months ago by

New activity in Qwen/Qwen3.5-397B-A17B-GPTQ-Int4 5 months ago

GPTQ vs Q4 GGUF

#2 opened 5 months ago by

New activity in mratsim/MiniMax-M2.5-BF16-INT4-AWQ 5 months ago

Cant get it to work on 8x RTX3090

#1 opened 6 months ago by

New activity in lukealonso/MiniMax-M2.5-NVFP4 5 months ago

"w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected."

#2 opened 6 months ago by

New activity in mratsim/MiniMax-M2.5-BF16-INT4-AWQ 5 months ago

accuracy

#4 opened 6 months ago by

New activity in mratsim/MiniMax-M2.1-BF16-INT4-AWQ 6 months ago

Fastest for my 3090x8

#1 opened 6 months ago by

New activity in 0xSero/MiniMax-M2.1-139B 7 months ago

Hey i like the model could you maybe make a NVFP4 version or a version optimised for the dgx spark?

#1 opened 7 months ago by

New activity in cerebras/GLM-4.7-REAP-218B-A32B 7 months ago

Please create also Minimax 2.1 REAP versions

#1 opened 7 months ago by

New activity in unsloth/MiniMax-M2.1-GGUF 7 months ago

Report: getting 20 t/s with UD-Q4_K_XL and 72 VRAM

#2 opened 7 months ago by

Hot Damn This Model Cooks!

#5 opened 7 months ago by

New activity in MiniMaxAI/MiniMax-M2.1 7 months ago

Please make 4 bit dwq mlx quant

#1 opened 7 months ago by

New activity in unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF 8 months ago

Please update llama.cpp to see improved performance!

#7 opened 8 months ago by

New activity in unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF 12 months ago

Updated Title: UDQ4_K_XL - Great Rust coder

#11 opened 12 months ago by

wonderfuldestruction

New activity in unsloth/Qwen3-235B-A22B-Thinking-2507-GGUF about 1 year ago

download link creates Q5_K_M instead of UD-Q5_K_XL named files

#2 opened about 1 year ago by

New activity in Qwen/Qwen3-Coder-480B-A35B-Instruct about 1 year ago

Confused about the eval score

#15 opened about 1 year ago by

New activity in ubergarm/DeepSeek-TNG-R1T2-Chimera-GGUF about 1 year ago

IQ3_KS metrics on mixed CUDA + CPU, pretty good model!

#2 opened about 1 year ago by

New activity in tngtech/DeepSeek-TNG-R1T2-Chimera about 1 year ago

What are the recommended settings?

#7 opened about 1 year ago by