AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation Paper • 2604.08540 • Published Apr 9 • 5
Running on CPU Upgrade Agents Featured 111 Cohere Multilingual ASR 🎙 111 Transcribe audio clips to text in many languages
Running on Zero MCP 2.64k Wan2.2 14B Preview 🐌 2.64k generate a video from an image with a text prompt
Running on T4 Agents Featured 80 Trackers 🔥 80 Track objects in your video and get an annotated result