HuanjinYao
/

R1-ShareVL-7B

Image-Text-to-Text

Model card Files Files and versions

Mulberry

R1-ShareVL-7b is a reasoning MLLM trained by ShareGRPO.

Paper: https://arxiv.org/abs/2505.16673

Code: https://github.com/HJYao00/R1-ShareVL

More Details

Base Model: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct

Training Framework: EasyR1

Hardware: 8x NVIDIA H100

Downloads last month: 18

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for HuanjinYao/R1-ShareVL-7B

Base model

Qwen/Qwen2.5-VL-7B-Instruct

Finetuned

(906)

this model

Quantizations

Collection including HuanjinYao/R1-ShareVL-7B

R1-ShareVL

3 items • Updated Jul 16 • 1