jingyiZ00 commited on
Commit
3f150d4
·
verified ·
1 Parent(s): 4613050

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -12,8 +12,8 @@ library_name: transformers
12
  <!-- Provide a quick summary of what the model is/does. -->
13
  R1-VL-7B is a reasoning model trained with step-wise group relative policy optimization (StepGRPO).
14
 
15
- ## Paper: https://arxiv.org/pdf/2503.12937
16
 
17
- ## Github: https://github.com/jingyi0000/R1-VL
18
 
19
- ## Base model: https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct
 
12
  <!-- Provide a quick summary of what the model is/does. -->
13
  R1-VL-7B is a reasoning model trained with step-wise group relative policy optimization (StepGRPO).
14
 
15
+ ### Paper: https://arxiv.org/pdf/2503.12937
16
 
17
+ ### Github: https://github.com/jingyi0000/R1-VL
18
 
19
+ ### Base model: https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct