⭐ Powered by FunASR — please give us a GitHub Star!

This model is part of the FunASR ecosystem — one industrial-grade open-source toolkit for ASR · VAD · punctuation · speaker diarization · emotion / event · LLM-ASR. A Star really helps the project (and keeps you updated):

🌟 FunASR · 🌟 SenseVoice · 🌟 Fun-ASR · 🌟 FunClip

Fun-ASR-Nano (HuggingFace Transformers)

This is the HuggingFace Transformers-compatible version of Fun-ASR-Nano-2512.

Fun-ASR-Nano is an end-to-end speech recognition model by FunAudioLLM, trained on tens of millions of hours of real speech data. It supports multilingual speech recognition covering Chinese (with dialects), English, Japanese, Korean, and many more languages.

For full documentation, benchmarks, and usage instructions, please refer to the main model card.

Downloads last month: 618

Safetensors

Model size

1.0B params

Tensor type

BF16