20 35 33

Max Ku

vinesmsuic

https://kuwingfung.github.io/

AI & ML interests

Computer Vision, World Models

Recent Activity

new activity 5 days ago

TIGER-Lab/ImagenWorld:Thank you.

upvoted a paper 11 days ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

upvoted a paper 12 days ago

SWE-Next: Scalable Real-World Software Engineering Tasks for Agents

View all activity

Organizations

New activity in TIGER-Lab/ImagenWorld 5 days ago

Thank you.

🔥 1

#2 opened 6 months ago by

Kjay

upvoted a paper 11 days ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published 14 days ago • 35

upvoted a paper 12 days ago

SWE-Next: Scalable Real-World Software Engineering Tasks for Agents

Paper • 2603.20691 • Published about 1 month ago • 10

authored a paper 20 days ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published 21 days ago • 30

upvoted a paper 20 days ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published 21 days ago • 30

submitted a paper to Daily Papers 20 days ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published 21 days ago • 30

liked a model 23 days ago

Skywork/Matrix-Game-3.0

Image-Text-to-Video • Updated 7 days ago • 200 • 110

upvoted a paper 26 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 94

liked a model about 2 months ago

nyu-visionx/solaris

Updated Mar 4 • 10

liked a Space about 2 months ago

Qwen Image Multiple Angles 3D Camera

🎥

2.34k

Edit image camera angle with interactive 3D controls

authored a paper 2 months ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

upvoted 2 papers 2 months ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36

upvoted a paper 4 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 74

updated 4 datasets 6 months ago

authored a paper 6 months ago

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Paper • 2509.26346 • Published Sep 30, 2025 • 19

upvoted a paper 6 months ago

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12, 2025 • 28

Max Ku

AI & ML interests

Recent Activity

Organizations

vinesmsuic's activity

Thank you.

Qwen Image Multiple Angles 3D Camera