CoreML Speech Models
Collection
Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. • 14 items • Updated • 1
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
CoreML models for Qwen3.5-0.8B hybrid chat LLM, optimized for Apple Neural Engine. Requires iOS 18+ / macOS 15+.
| File | Size | Description |
|---|---|---|
int8/embedding.mlpackage |
254 MB | Token embedding lookup |
int8/decoder.mlpackage |
753 MB | Full decoder (24 layers + LM head) |
import Qwen3Chat
let chat = try await Qwen35CoreMLChat.fromPretrained(quantization: .int8)
let response = try chat.generate(messages: [
ChatMessage(role: .user, content: "Hello!")
])
python scripts/convert_qwen35_chat_coreml.py --output /tmp/qwen35-coreml --quantize int8
Apache-2.0