LuffyTheFox/Qwen3.5-27B-Claude-4.6-Opus-Uncensored-V2-Kullback-Leibler-GGUF

Any possibilities Q6_KL?

by johnlaborxxx - opened 9 days ago

Hi, wonderful model.

Just wondering if it is possible on a Q6_KL gguf?
Q8 is too large yet Q4 is too small, where Q6 is a sweet spot for 32GB vram. Thanks.

LuffyTheFox

Owner 9 days ago

Hi, wonderful model.

Just wondering if it is possible on a Q6_KL gguf?
Q8 is too large yet Q4 is too small, where Q6 is a sweet spot for 32GB vram. Thanks.

Hello, llama-quantize supports Q6_K. So yes, Q6_K quant is possible. I will do it later.

LuffyTheFox

Owner 9 days ago

Hi, wonderful model.

Just wondering if it is possible on a Q6_KL gguf?
Q8 is too large yet Q4 is too small, where Q6 is a sweet spot for 32GB vram. Thanks.

Done. Q6_K - KL quant uploaded. Enjoy 😀

LuffyTheFox

Owner 8 days ago

I think this request is done 😃

LuffyTheFox changed discussion status to closed 8 days ago

johnlaborxxx

7 days ago

•

edited 7 days ago

Great, downloading and testing now, thanks for taking the time and efforts!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment