-
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Paper • 2310.14566 • Published • 27 -
TouchStone: Evaluating Vision-Language Models by Language Models
Paper • 2308.16890 • Published • 1
donghunlee
hundong2
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Advancing Open-source World Models liked
a model about 2 months ago
nvidia/nemotron-speech-streaming-en-0.6b upvoted a collection about 2 months ago
Falcon-H1R Organizations
None yet