Safetensors
qwen2_5_vl
yifanzhang114 commited on
Commit
6a802aa
·
verified ·
1 Parent(s): d1b5bb4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -32,10 +32,10 @@ license: apache-2.0
32
 
33
  If you find it useful for your research and applications, please cite related papers/blogs using this BibTeX:
34
  ```bibtex
35
- @article{zhang2025mm,
36
- title={MM-RLHF: The Next Step Forward in Multimodal LLM Alignment},
37
- author={Zhang, Yi-Fan and Yu, Tao and Tian, Haochen and Fu, Chaoyou and Li, Peiyan and Zeng, Jianshu and Xie, Wulin and Shi, Yang and Zhang, Huanyu and Wu, Junkang and others},
38
- journal={arXiv preprint arXiv:2502.10391},
39
  year={2025}
40
  }
41
  ```
 
32
 
33
  If you find it useful for your research and applications, please cite related papers/blogs using this BibTeX:
34
  ```bibtex
35
+ @article{zhang2025r1,
36
+ title={R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning},
37
+ author={Zhang, Yi-Fan and Lu, Xingyu and Hu, Xiao and Fu, Chaoyou and Wen, Bin and Zhang, Tianke and Liu, Changyi and Jiang, Kaiyu and Chen, Kaibing and Tang, Kaiyu and others},
38
+ journal={arXiv preprint arXiv:2505.02835},
39
  year={2025}
40
  }
41
  ```