Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model
ASLP-lab
ASLP-lab
AI & ML interests
None yet
Recent Activity
updated a dataset about 11 hours ago
ASLP-lab/UrduSpeech liked a model about 23 hours ago
Soul-AILab/SoulX-Transcriber liked a dataset 7 days ago
ASLP-lab/HumDial-EIBenchOrganizations
None yet
spaces 8
Configuration error
Agents
8
YingMusic-Singer-Plus
🎤
Edit lyrics, keep the melody
Runtime error
Agents
12
WenetSpeech Yue
🔥
Large-Scale Cantonese Speech Corpus
Runtime error
Agents
1
VoiceSculptor
📚
Running on Zero
Agents
44
DiffRhythm2
🎵
Generate a full song from lyrics and style prompts
Configuration error
Agents
22
SongFormer
🎵
State-of-the-art music analysis with multi-scale datasets
Running on Zero
Agents
Featured
688
Di♪♪Rhythm
🎶
Blazingly Fast and Embarrassingly Simple Song Generation
models 35
ASLP-lab/FM-Speech
Audio Classification • Updated • 2
ASLP-lab/SongFormer
0.7B • Updated • 418 • 17
ASLP-lab/Speaker-Reasoner
32B • Updated • 22 • 2
ASLP-lab/Speaker-Reasoner-4194h
32B • Updated • 124 • 1
ASLP-lab/YingMusic-Singer-Plus
0.7B • Updated • 1.28k • 7
ASLP-lab/OmniCodec
Feature Extraction • Updated • 1
ASLP-lab/OSUM-Pangu
Audio-to-Audio • Updated • 2
ASLP-lab/VoiceSculptor-VD
Text-to-Speech • 4B • Updated • 26 • 18
ASLP-lab/WenetSpeech-Wu-Speech-Understanding
Updated • 2
ASLP-lab/WenetSpeech-Wu-Speech-Generation
Text-to-Speech • Updated • 3
datasets 21
ASLP-lab/UrduSpeech
Viewer • Updated • 73.4k • 14k
ASLP-lab/HumDial-EIBench
Viewer • Updated • 1 • 1.91k • 2
ASLP-lab/FMSU-Bench
Updated • 95 • 1
ASLP-lab/FastTurn-Testset
Updated • 96
ASLP-lab/SongFormDB
Updated • 7.61k • 8
ASLP-lab/SongFormBench
Viewer • Updated • 3.82k • 613 • 3
ASLP-lab/HumDial-FDBench
Updated • 211 • 3
ASLP-lab/WSC-Train
Preview • Updated • 391 • 125
ASLP-lab/LyricEditBench
Viewer • Updated • 7.2k • 517 • 2
ASLP-lab/WenetSpeech-Wu-Bench
Viewer • Updated • 242 • 403 • 4