Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC Paper • 2505.24200 • Published May 30, 2025
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published 10 days ago • 19
view post Post 3233 Built a small site for tracking speech-to-speech, full-duplex, and audio foundation model work.It covers models, benchmarks, datasets, and some blog posts to organize the landscape in one place.Still early, but sharing in case it is useful:https://www.fullduplex.ai/If you spot missing entries or mistakes, I would really appreciate corrections. See translation 2 replies · 🔥 3 3 + Reply