neuphonic/neutts-air
Text-to-Speech
β’
0.7B
β’
Updated
β’
10.5k
β’
848
State-of-the-art target speech extractor
Extreme Super-Resolution via Scale Autoregression
Generate large 3D models using spatial sparse attention
Watch and experiment with realtime AIβs with visuals
Voice Activity Detection using MarbleNet model
Filter multilingual data for high-quality language models
Transcribe speech and highlight emphasized words
Generate engaging educational videos