OpenGVLab

community

https://github.com/opengvlab

opengvlab

OpenGVLab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

YYangzzzz authored a paper 1 day ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

vansin submitted a paper 2 days ago

End-to-End Video Character Replacement without Structural Guidance

heroding77 authored a paper 3 days ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

View all activity

Papers

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs

View all Papers

OpenGVLab 's models 286

OpenGVLab/InternVL3-9B-Pretrained

Image-Text-to-Text • 9B • Updated Apr 25, 2025 • 55

OpenGVLab/InternVL2_5-8B-MPO-hf

Image-to-Text • 8B • Updated Apr 23, 2025 • 695

OpenGVLab/InternVL2_5-2B-MPO-hf

Image-to-Text • 2B • Updated Apr 23, 2025 • 4.54k

OpenGVLab/InternVL3-8B-hf

Image-Text-to-Text • 8B • Updated Apr 23, 2025 • 15.8k • 9

OpenGVLab/InternVL3-78B-hf

Image-Text-to-Text • 78B • Updated Apr 23, 2025 • 374 • 2

OpenGVLab/InternVL3-38B-hf

Image-Text-to-Text • 38B • Updated Apr 23, 2025 • 1.16k • 2

OpenGVLab/InternVL3-14B-hf

Image-Text-to-Text • 15B • Updated Apr 23, 2025 • 3.45k

OpenGVLab/InternVL3-2B-hf

Image-Text-to-Text • 2B • Updated Apr 23, 2025 • 4.94k • 3

OpenGVLab/InternVL3-1B-hf

Image-Text-to-Text • 0.9B • Updated Apr 23, 2025 • 84.4k • 10

OpenGVLab/VideoChat-R1_7B

Video-Text-to-Text • 8B • Updated Apr 22, 2025 • 183 • 8

OpenGVLab/VideoChat-R1_7B_caption

Video-Text-to-Text • 8B • Updated Apr 22, 2025 • 45 • 4

OpenGVLab/PIIP-LLaVA-Plus_ConvNeXt-L_CLIP-L_1024-336_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 25

OpenGVLab/clip-vit-large-patch14to16-224

0.4B • Updated Apr 20, 2025 • 21

OpenGVLab/PIIP-LLaVA_CLIP-BL_512-256_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 25

OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_1024-336_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 21

OpenGVLab/PIIP-LLaVA_ConvNeXt-L_CLIP-L_1024-336_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 21

OpenGVLab/clip-vit-large-patch14to16-336

0.4B • Updated Apr 20, 2025 • 25

OpenGVLab/PIIP-LLaVA_CLIP-BL_512-448_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 23

OpenGVLab/PIIP-LLaVA_ConvNeXt-L_CLIP-L_1024-336_13B

Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 24

OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_640-224_7B

Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 26

OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_1024-336_13B

Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 22

OpenGVLab/PIIP-LLaVA_CLIP-BL_512-448_13B

Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 24

OpenGVLab/InternVL3-9B-AWQ

Image-Text-to-Text • Updated Apr 17, 2025 • 54 • 1

OpenGVLab/PIIP

Object Detection • Updated Apr 16, 2025 • 5

OpenGVLab/VideoChat-R1-thinking_7B

Video-Text-to-Text • 8B • Updated Apr 13, 2025 • 32

OpenGVLab/Mini-InternVL2-2B-DA-BDD

Image-Text-to-Text • 2B • Updated Mar 26, 2025 • 62 • 1

OpenGVLab/Mini-InternVL2-2B-DA-DriveLM

Image-Text-to-Text • 2B • Updated Mar 26, 2025 • 88 • 2

OpenGVLab/Mini-InternVL2-2B-DA-Medical

Image-Text-to-Text • 2B • Updated Mar 26, 2025 • 64 • 1

OpenGVLab/InternVL2_5-26B-MPO

Image-Text-to-Text • 26B • Updated Mar 25, 2025 • 169 • 14

OpenGVLab/InternVL2_5-8B-MPO

Image-Text-to-Text • 8B • Updated Mar 25, 2025 • 209 • 48