YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Created with ThunderFun's QUIP implementation, merged into regular row-wise int8. Bit of a misnomer now.

For use with:

https://github.com/BobJohnson24/ComfyUI-Flux2-INT8

https://github.com/ThunderFun/ComfyUI-Wan-INT8

Not quite as much speedup as flux2 klein 9b. 00:43<00:00, 1.76s/it (BF16) 00:27<00:00, 1.09s/it (INT8 QUIP) about 1.59x faster than bf16 on my 3090.

It was necessary to keep layers.0, layers.27,28,29 in BF16 to avoid subtle artifacting.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support