Training Multimodal Reward Model Through Stable Reinforcement Learning
Yi-Fan Zhang
yifanzhang114
AI & ML interests
Yi-Fan Zhang presently is a forth-year PhD student at the State Key Laboratory of Pattern Recognition, University of Chinese Academy of Sciences, under the esteemed guidance of Prof. Tieniu Tan, is dedicated to spearheading robust and reliable deep learning systems and large pretrained models.
Recent Activity
updated
a model
about 23 hours ago
yifanzhang114/Glm_highres_kwa_all_round
published
a model
about 24 hours ago
yifanzhang114/Glm_highres_kwa_all_round
updated
a model
2 days ago
yifanzhang114/GLM_kwai_ocr_count_vqa_cosistency_35k_and_claude50k_5epoch_epoch5