Ai-Model
-
Image-Text-to-Text β’ 25B β’ Updated β’ 34.5k β’ 637 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition β’ 0.8B β’ Updated β’ 2.74M β’ β’ 2.8k -
SWivid/F5-TTS
Text-to-Speech β’ Updated β’ 750k β’ 1.15k -
D-Edit
π84 -
FacePoke
π2.21kImport a portrait, click to move the head!
-
Expression Editor
π¨1.6kQuickly edit the expression of a face
-
F5-TTS
π£2.79kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
FLUX.1 [dev]
π₯9.38kGenerate images from text descriptions
-
Face Recognition SDK
π’234Face Recognition
-
Open NotebookLM
π1.1kPersonalised Podcasts For All - Available in 13 Languages
-
PMRF
πΌ315A gradio demo for Posterior-Mean Rectified Flow (PMRF)
-
stabilityai/stable-diffusion-3.5-large
Text-to-Image β’ Updated β’ 63.3k β’ β’ 3.34k -
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 2.35k β’ β’ 1.3k -
Freepik/flux.1-lite-8B-alpha
Text-to-Image β’ Updated β’ 1.29k β’ 428 -
rhymes-ai/Allegro
Text-to-Video β’ Updated β’ 54 β’ 264 -
CohereLabs/aya-expanse-8b
Text Generation β’ 8B β’ Updated β’ 150k β’ 419 -
deepseek-ai/Janus-1.3B
Any-to-Any β’ 2B β’ Updated β’ 2.62k β’ 592 -
Pangea
π50A Fully Open Multilingual Multimodal LLM for 39 Languages
-
Etched/oasis-500m
Updated β’ 51 β’ 488 -
microsoft/OmniParser
Image-Text-to-Text β’ Updated β’ 375 β’ 1.71k -
OuteAI/OuteTTS-0.1-350M
Text-to-Speech β’ 0.4B β’ Updated β’ 5.2k β’ 302 -
tencent/Tencent-Hunyuan-Large
Text Generation β’ Updated β’ 52 β’ 617 -
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation β’ 71B β’ Updated β’ 4.16k β’ β’ 2.06k -
tencent/HunyuanVideo
Text-to-Video β’ Updated β’ 1.34k β’ β’ 2.11k -
zai-org/CogVideoX-5b
Text-to-Video β’ Updated β’ 35.8k β’ β’ 661 -
LanguageBind/Open-Sora-Plan-v1.2.0
Updated β’ 47 -
microsoft/phi-4
Text Generation β’ 15B β’ Updated β’ 593k β’ 2.21k -
TRELLIS
π’4.78kScalable and Versatile 3D Generation from images
-
Search Your Face Online
π830Track your online presence with reverse face search
-
Kolors Virtual Try-On
π10kTry on clothes on a person image
-
DeepSeek-R1 WebGPU
π§554Next-generation reasoning model that runs locally in-browser
-
AnyCoder
π3.11kGenerate code snippets for web applications using AI
-
tencent/Hunyuan3D-2
Image-to-3D β’ Updated β’ 65.9k β’ 1.7k -
openbmb/MiniCPM-o-2_6
Any-to-Any β’ 9B β’ Updated β’ 81.6k β’ 1.28k -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation β’ 71B β’ Updated β’ 317k β’ β’ 740 -
Magic Face
π€ͺ244Transform Your Face Into Legendary Characters!
-
Llasa 3b Tts
π₯313Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
mistralai/Mistral-Small-24B-Instruct-2501
24B β’ Updated β’ 766k β’ 950 -
Pyramid Flow
β±673Generate videos from text prompts and optional images
-
microsoft/OmniParser-v2.0
Updated β’ 880 β’ 1.31k -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech β’ Updated β’ 2.88k β’ 1.1k -
perplexity-ai/r1-1776
Text Generation β’ 671B β’ Updated β’ 596 β’ 2.33k -
agentica-org/DeepScaleR-1.5B-Preview
Text Generation β’ 2B β’ Updated β’ 71.9k β’ 578 -
stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text β’ 132B β’ Updated β’ 121 β’ 459 -
hexgrad/Kokoro-82M
Text-to-Speech β’ Updated β’ 2.26M β’ β’ 5.63k -
black-forest-labs/FLUX.1-dev
Text-to-Image β’ Updated β’ 777k β’ β’ 12.2k -
NousResearch/DeepHermes-3-Llama-3-8B-Preview
Text Generation β’ 8B β’ Updated β’ 306 β’ β’ 354