Trend models for DGX Spark ✨
Collection
Build in DGX Spark, for trended models ✨ • 9 items • Updated
You can easily run this model using the DGX-Spark-llama.cpp-Bench inference engine. It's pre-configured for high-performance inference on NVIDIA hardware (especially Blackwell/DGX Spark).
docker pull ghcr.io/sowilow/dgx-spark-llama.cpp-bench:latest
For detailed configuration and usage, visit the GitHub Repository.
This repository contains GGUF-quantized weights for Qwen3.5-35B-A3B, specifically optimized for NVIDIA Blackwell (DGX Spark) hardware.
This model is a quantized version of the original Qwen/Qwen3.5-35B-A3B and is subject to the Qwen License Agreement.
By using this model, you agree to comply with Alibaba Cloud / Qwen's licensing terms.
qwen3.5-35b-a3b-q4_k_m.gguf: Main model weights.qwen3.5-2b-mmproj-f16.gguf: Multimodal vision projector.Created using DGX-Spark-llama.cpp-Bench
4-bit