Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
rovo 's Collections
3D Mesh
Audio
interesting
controlnet
Diffusion
Text Generation
Dataset
codellm
Diffusion LORAs
Clip Vision
Papers
Flux

Audio

updated 16 days ago
Upvote
-

  • fishaudio/fish-speech-1.5

    Text-to-Speech • Updated Mar 25, 2025 • 6.46k • 743

  • suno/bark

    Text-to-Speech • Updated Oct 4, 2023 • 18.7k • 1.52k

  • SWivid/F5-TTS

    Text-to-Speech • Updated Mar 21, 2025 • 519k • 1.17k

  • NexaAI/OmniAudio-2.6B

    Audio-Text-to-Text • 3B • Updated Dec 13, 2024 • 1.06k • 289

  • Running
    20

    3DAudio-Spectrum-Analyzer - One-minute creation by AI Coding Autonomous Agent

    📉
    20

    https://huggingface.co/spaces/VIDraft/mouse-webgen


  • sesame/csm-1b

    Text-to-Speech • Updated Dec 1, 2025 • 210k • 2.37k

  • argmaxinc/whisperkit-coreml

    Automatic Speech Recognition • Updated 22 days ago • 11.1M • 177
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs