mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-5bit

This model was converted to MLX format from Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign using mlx-audio version 0.3.0rc1.

Refer to the original model card for more details on the model.

Use with mlx-audio

pip install -U mlx-audio

CLI Example

python -m mlx_audio.tts.generate --model mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-5bit --text "Hello, this is a test." --instruct "A cheerful young female voice with high pitch and energetic tone."

Python Example

from mlx_audio.tts.utils import load_model
from mlx_audio.tts.generate import generate_audio

model = load_model("mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-5bit")
generate_audio(
    model=model,
    text="Hello, this is a test.",
    instruct="A cheerful young female voice with high pitch and energetic tone.",
    file_prefix="test_audio",
)
Downloads last month
38
Safetensors
Model size
0.4B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to view the estimation

5-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-5bit