jingyiZ00
/

R1-VL-7B

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

jingyiZ00 commited on Mar 21

Commit

3f150d4

·

verified ·

1 Parent(s): 4613050

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -12,8 +12,8 @@ library_name: transformers
 <!-- Provide a quick summary of what the model is/does. -->
 R1-VL-7B is a reasoning model trained with step-wise group relative policy optimization (StepGRPO).
-## Paper: https://arxiv.org/pdf/2503.12937
-## Github: https://github.com/jingyi0000/R1-VL
-## Base model: https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct

 <!-- Provide a quick summary of what the model is/does. -->
 R1-VL-7B is a reasoning model trained with step-wise group relative policy optimization (StepGRPO).
+### Paper: https://arxiv.org/pdf/2503.12937
+### Github: https://github.com/jingyi0000/R1-VL
+### Base model: https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct