arxiv:2604.08546
Xin Zhou
LMD0311
·
AI & ML interests
None yet
Recent Activity
authored a paper about 19 hours ago
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models authored a paper 12 days ago
Seeing the Future, Perceiving the Future: A Unified Driving World Model
for Future Generation and Perception