Shengyi Costa Huang

vwxyzjn

AI & ML interests

None yet

Recent Activity

updated a dataset about 24 hours ago
ai2-adapt-dev/rlvr_open_reasoner_math
published a dataset about 24 hours ago
ai2-adapt-dev/rlvr_open_reasoner_math
updated a dataset about 24 hours ago
ai2-adapt-dev/rlvr_gsm8k_zs
View all activity

Organizations

Ai2's profile picture cleanrl's profile picture lm-human-preference-details's profile picture ICML2023's profile picture Brrr Gang's profile picture AI2 Adapt Dev's profile picture Dev Mode Explorers's profile picture OLMoE's profile picture

vwxyzjn's activity

published an article 8 months ago
view article
Article

How NuminaMath Won the 1st AIMO Progress Prize

116
published an article 8 months ago
view article
Article

Preference Optimization for Vision Language Models

59
published an article 8 months ago
view article
Article

Putting RL back in RLHF

75
published an article about 1 year ago
view article
Article

Constitutional AI with Open LLMs

13
published an article over 1 year ago
view article
Article

The N Implementation Details of RLHF with PPO

39