Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning Paper • 2601.21037 • Published 17 days ago • 15
RAVENEA Collection Collection for "RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding" • 4 items • Updated 1 day ago
Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning Paper • 2601.21037 • Published 17 days ago • 15
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published Dec 8, 2025 • 46
PokemonChat: Auditing ChatGPT for Pokémon Universe Knowledge Paper • 2306.03024 • Published Jun 5, 2023 • 2
Structural Similarities Between Language Models and Neural Response Measurements Paper • 2306.01930 • Published Jun 2, 2023 • 2
RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding Paper • 2505.14462 • Published May 20, 2025 • 4
RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding Paper • 2505.14462 • Published May 20, 2025 • 4