Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 5 items • Updated 12 days ago • 21
GLiNER-decoder Collection A joint encoder-decoder GLiNER model for a scalable open-ontology entity recognition • 3 items • Updated 18 days ago • 17
X-Talk: On the Underestimated Potential of Modular Speech-to-Speech Dialogue System Paper • 2512.18706 • Published Dec 21, 2025 • 1
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 27 days ago • 38
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents Paper • 2512.23343 • Published Dec 29, 2025 • 29
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated 12 days ago • 38
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation Paper • 2508.19320 • Published Aug 26, 2025 • 29
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 22 items • Updated 27 days ago • 98
SANA-1.5 Collection SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated Sep 13, 2025 • 10
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 188