Mingyu Chen's picture

2

Mingyu Chen

MYC081

AI & ML interests

theory

Recent Activity

updated a dataset 4 days ago

MYC081/deepscale_3b_eval_correct

published a dataset 4 days ago

MYC081/deepscale_3b_eval_correct

upvoted a paper 15 days ago

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

View all activity

Organizations

None yet

MYC081's activity

upvoted a paper 15 days ago

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

Paper • 2505.20686 • Published 18 days ago • 2

upvoted a paper 6 months ago

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Paper • 2405.21046 • Published May 31, 2024 • 4