VIBE

community

AI & ML interests

None defined yet.

Recent Activity

yifanzhang114 authored a paper 9 days ago

VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents

yifanzhang114 authored a paper 9 days ago

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

yifanzhang114 authored a paper 9 days ago

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

View all activity

authored 4 papers 9 days ago

VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents

Paper • 2603.16289 • Published about 1 month ago

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Paper • 2603.29620 • Published 17 days ago • 46

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Paper • 2604.03016 • Published 14 days ago • 37

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 11 days ago • 232

authored 2 papers 29 days ago

PlannerRFT: Reinforcing Diffusion Planners through Closed-Loop and Sample-Efficient Fine-Tuning

Paper • 2601.12901 • Published Jan 19

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published Feb 2 • 16

submitted a paper to Daily Papers 2 months ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Paper • 2601.21037 • Published Jan 28 • 15

submitted a paper to Daily Papers 2 months ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published Feb 2 • 16

updated 12 datasets 2 months ago

VIBE-Benchmark/VIBE-Benchmark

Viewer • Updated Feb 2 • 2.65k • 360 • 3

VIBE-Benchmark/VIBE-Seedream4.0

Viewer • Updated Feb 1 • 1.03k • 42

VIBE-Benchmark/VIBE-Seedream4.5

Viewer • Updated Feb 1 • 1.03k • 26

VIBE-Benchmark/OmniGen

Viewer • Updated Feb 1 • 1.03k • 10

VIBE-Benchmark/VIBE-Banana-Flash

Viewer • Updated Feb 1 • 1.01k • 9

VIBE-Benchmark/VIBE-GPT-Image

Viewer • Updated Feb 1 • 1.01k • 9

VIBE-Benchmark/Edit-R1-Qwen-Image-Edit-2509

Viewer • Updated Feb 1 • 1.03k • 16

VIBE-Benchmark/Qwen-Image-Edit-2509

Viewer • Updated Feb 1 • 1.03k • 11

VIBE-Benchmark/VIBE-Qwen-Image-Edit

Viewer • Updated Feb 1 • 934 • 10

VIBE-Benchmark/FLUX2-dev

Viewer • Updated Feb 1 • 1.03k • 22

VIBE-Benchmark/OmniGen2

Viewer • Updated Feb 1 • 1.03k • 4 • 1

VIBE-Benchmark/UniWorld-V1

Viewer • Updated Feb 1 • 1.03k • 6