LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs? Paper • 2605.08985 • Published 15 days ago • 21
GestaltLabs/Qwen3.6-35B-A3B-NSC-ACE-SABER-GGUF Image-Text-to-Text • 35B • Updated 9 days ago • 2.11k • 3
WebWorld: A Large-Scale World Model for Web Agent Training Paper • 2602.14721 • Published Feb 16 • 18