-
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Paper • 2310.14566 • Published • 27 -
TouchStone: Evaluating Vision-Language Models by Language Models
Paper • 2308.16890 • Published • 1
donghunlee
hundong2
AI & ML interests
None yet
Recent Activity
upvoted a paper 4 days ago
A decoder-only foundation model for time-series forecasting upvoted a paper about 1 month ago
TT4D: A Pipeline and Dataset for Table Tennis 4D Reconstruction From Monocular Videos upvoted a paper 5 months ago
Advancing Open-source World ModelsOrganizations
None yet