Zhaolin Gao

GitBag

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a dataset about 21 hours ago
GitBag/math_1.5B_8k_eval
published a dataset about 21 hours ago
GitBag/math_1.5B_8k_eval
updated a dataset about 22 hours ago
GitBag/math_1.5B_8k
View all activity

Organizations

Cornell-AGI's profile picture Cornell University's profile picture

Articles 1

Article
6

RLHF 101: A Technical Dive into RLHF