Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels Paper • 2405.16822 • Published May 27, 2024 • 13
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations Paper • 2509.09676 • Published Sep 11, 2025 • 35
Efficiently Reconstructing Dynamic Scenes One D4RT at a Time Paper • 2512.08924 • Published Dec 9, 2025 • 19
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Paper • 2509.19296 • Published Sep 23, 2025 • 25
SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation Paper • 2512.10860 • Published Dec 11, 2025 • 1
VoluMe -- Authentic 3D Video Calls from Live Gaussian Splat Prediction Paper • 2507.21311 • Published Jul 28, 2025 • 1
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams Paper • 2506.08862 • Published Jun 10, 2025 • 6
PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos Paper • 2509.25183 • Published Sep 29, 2025 • 3
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning Paper • 2510.08555 • Published Oct 9, 2025 • 64
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Paper • 2601.10657 • Published Jan 15 • 20
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published Jan 8 • 55
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published Jan 9 • 53
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published Jan 4 • 57
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published Jan 2 • 56
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published Dec 31, 2025 • 119
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 126
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published Jan 1 • 131
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published Dec 31, 2025 • 151