Hi! You are providing impressing NVFP4 quantizations. Could you prepare a DAQ NVFP4 of cerebras/GLM-4.7-Flash-REAP-23B-A3B ? This is the best model so far that I can fit on my RTX5080, and no one is quantizing in such a high quality as you.
· Sign up or log in to comment