Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding
Paper • 2604.05015 • Published • 181
Computer Vision
RIVER: A Real-Time Interaction Benchmark for Video LLMs
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision