YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Created with ThunderFun's QUIP implementation, merged into regular row-wise int8. Bit of a misnomer now.
For use with:
https://github.com/BobJohnson24/ComfyUI-Flux2-INT8
https://github.com/ThunderFun/ComfyUI-Wan-INT8
Not quite as much speedup as flux2 klein 9b. 00:43<00:00, 1.76s/it (BF16) 00:27<00:00, 1.09s/it (INT8 QUIP) about 1.59x faster than bf16 on my 3090.
It was necessary to keep layers.0, layers.27,28,29 in BF16 to avoid subtle artifacting.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support