OpenGVLab

community

https://github.com/opengvlab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

ganlinyang submitted a paper about 1 hour ago

EventVLA: Event-Driven Visual Evidence Memory for Long-Horizon Vision-Language-Action Policies

qishisuren submitted a paper 9 days ago

Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO

Eurayka authored a paper 13 days ago

InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning

View all activity

Papers

Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction

RIVER: A Real-Time Interaction Benchmark for Video LLMs

View all Papers

OpenGVLab 's models 286

OpenGVLab/InternVL3_5-241B-A28B-MPO

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 36 • 2

OpenGVLab/InternVL3_5-241B-A28B-Pretrained

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 30 • 1

OpenGVLab/InternVL3_5-241B-A28B-Instruct

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 56.1k • 15

OpenGVLab/InternVL3_5-38B-MPO

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 74 • 2

OpenGVLab/InternVL3_5-38B-Pretrained

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 24 • 2

OpenGVLab/InternVL3_5-38B

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 12.8k • 44

OpenGVLab/InternVL3_5-30B-A3B-MPO

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 37 • 4

OpenGVLab/InternVL3_5-30B-A3B

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 92.5k • 43

OpenGVLab/InternVL3_5-30B-A3B-Pretrained

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 33 • 1

OpenGVLab/InternVL3_5-38B-Instruct

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 57k • 6

OpenGVLab/InternVL3_5-14B-MPO

Image-Text-to-Text • 15B • Updated Aug 29, 2025 • 68 • 3

OpenGVLab/InternVL3_5-14B

Image-Text-to-Text • 15B • Updated Aug 29, 2025 • 137k • 30

OpenGVLab/InternVideo2_5_Chat_8B

Video-Text-to-Text • 8B • Updated Aug 4, 2025 • 6.35k • 91

OpenGVLab/ScaleCUA_Env

Updated Jul 31, 2025 • 2

OpenGVLab/InternVideo2-Stage2_6B-224p-f4

Updated Jul 30, 2025 • 6

OpenGVLab/Mono-InternVL-2B-S1-3

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 12 • 1

OpenGVLab/Mono-InternVL-2B-S1-2

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 12 • 1

OpenGVLab/Mono-InternVL-2B-S1-1

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 8

OpenGVLab/Docopilot-8B

Image-Text-to-Text • 8B • Updated Jul 20, 2025 • 14 • 3

OpenGVLab/Docopilot-2B

Image-Text-to-Text • 2B • Updated Jul 20, 2025 • 19 • 8

OpenGVLab/ZeroGUI-OSWorld-7B

Image-Text-to-Text • 8B • Updated Jun 20, 2025 • 14 • 7

OpenGVLab/InternVideo1.0

Video Classification • Updated Jun 10, 2025 • 1

OpenGVLab/ZeroGUI-AndroidLab-7B

Image-Text-to-Text • 8B • Updated May 30, 2025 • 15 • 5

OpenGVLab/InternVL3-9B-Instruct

Image-Text-to-Text • 9B • Updated May 29, 2025 • 172 • 4

OpenGVLab/InternVL3-9B

Image-Text-to-Text • 9B • Updated May 29, 2025 • 4.27k • 25

OpenGVLab/VisualPRM-8B-v1_1

Image-Text-to-Text • 8B • Updated May 29, 2025 • 42 • 9

OpenGVLab/InternVideo2_CLIP_S

0.4B • Updated May 22, 2025 • 1.22k • 3

OpenGVLab/VideoChat-Flash-Qwen2_5-7B-1M_res224

Video-Text-to-Text • 8B • Updated May 16, 2025 • 15 • 2

OpenGVLab/InternVL_2_5_HiCo_R64

Video-Text-to-Text • 8B • Updated May 13, 2025 • 64 • 4

OpenGVLab/VisualPRM-8B

Image-Text-to-Text • 8B • Updated May 6, 2025 • 70 • 18