Yuhao Dong PRO

THUdyh

33 151 34

AI & ML interests

None yet

Recent Activity

authored a paper 11 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

authored a paper 11 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

authored a paper 11 days ago

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

View all activity

Organizations

authored 4 papers 11 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published May 18 • 22

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Paper • 2606.20515 • Published 21 days ago • 40

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution

Paper • 2606.27313 • Published 14 days ago • 38

upvoted a paper 13 days ago

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution

Paper • 2606.27313 • Published 14 days ago • 38

upvoted a paper 20 days ago

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Paper • 2606.20515 • Published 21 days ago • 40

upvoted a paper about 1 month ago

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

Paper • 2606.05259 • Published Jun 3 • 39

authored a paper about 1 month ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published May 27 • 76

upvoted 3 papers about 1 month ago

GEM: Generative Supervision Helps Embodied Intelligence

Paper • 2605.28548 • Published May 27 • 32

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published May 27 • 76

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

upvoted 3 papers about 2 months ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published May 18 • 22

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published May 14 • 116

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published May 12 • 194

upvoted a paper 2 months ago

Co-Evolving Policy Distillation

Paper • 2604.27083 • Published Apr 29 • 68

upvoted 2 papers 3 months ago

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published Apr 22 • 18

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 88

liked a model 3 months ago

moonshotai/Kimi-K2.6

Image-Text-to-Text • 1.1T • Updated May 19 • 1.99M • • 1.51k

liked a dataset 3 months ago

llamaindex/ParseBench

Benchmark • Updated Apr 19 • 169k • 12.8k • 101

upvoted a paper 3 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 114

Yuhao Dong PRO

AI & ML interests

Recent Activity

Organizations

THUdyh's activity