Mingyu Chen's picture

2

Mingyu Chen

MYC081

AI & ML interests

theory

Recent Activity

updated a dataset 4 days ago

MYC081/deepscale_3b_eval_correct

published a dataset 4 days ago

MYC081/deepscale_3b_eval_correct

upvoted a paper 15 days ago

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

View all activity

Organizations

None yet

Papers 1

arxiv:2505.20686

models 10

MYC081/SELM-Llama-3-8B-Instruct-DPO-iter-3

Updated Feb 3 • 8

MYC081/SELM-Zephyr-7B-iter-0

MYC081/Qwen2.5-3B-WPO-bf16-1

Text Generation • Updated Nov 15, 2024 • 19

MYC081/Qwen2.5-3B-WPO-bf16-1-test

Updated Nov 14, 2024

MYC081/Qwen2.5-1.5B-WPO-bf16-1

Updated Nov 14, 2024

MYC081/Qwen2-0.5B-WPO-bf16-1

Updated Nov 14, 2024 • 8

MYC081/pythia-1b-tldr-xpo

Updated Nov 13, 2024 • 23

MYC081/pythia-6.9b-deduped-tldr-online-dpo

Updated Nov 11, 2024

MYC081/Qwen2.5-0.5B-Online-DPO-PairRM

Updated Nov 5, 2024

MYC081/pythia-2.8b-deduped-tldr-online-dpo

Updated Nov 5, 2024

datasets 3

MYC081/deepscale_3b_eval_correct

Viewer • Updated 4 days ago • 40.3k • 52

MYC081/math_3b_eval_gpt_correct

Viewer • Updated 17 days ago • 7.5k • 206

MYC081/math_3b_eval_correct

Viewer • Updated 19 days ago • 7.5k • 73