Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
Mingyu Chen
MYC081
Follow
AI & ML interests
theory
Recent Activity
updated
a dataset
4 days ago
MYC081/deepscale_3b_eval_correct
published
a dataset
4 days ago
MYC081/deepscale_3b_eval_correct
upvoted
a
paper
15 days ago
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
View all activity
Organizations
None yet
Papers
1
arxiv:
2505.20686
models
10
Sort: Recently updated
MYC081/SELM-Llama-3-8B-Instruct-DPO-iter-3
Updated
Feb 3
•
8
MYC081/SELM-Zephyr-7B-iter-0
Updated
Jan 16
MYC081/Qwen2.5-3B-WPO-bf16-1
Text Generation
•
Updated
Nov 15, 2024
•
19
MYC081/Qwen2.5-3B-WPO-bf16-1-test
Updated
Nov 14, 2024
MYC081/Qwen2.5-1.5B-WPO-bf16-1
Updated
Nov 14, 2024
MYC081/Qwen2-0.5B-WPO-bf16-1
Updated
Nov 14, 2024
•
8
MYC081/pythia-1b-tldr-xpo
Updated
Nov 13, 2024
•
23
MYC081/pythia-6.9b-deduped-tldr-online-dpo
Updated
Nov 11, 2024
MYC081/Qwen2.5-0.5B-Online-DPO-PairRM
Updated
Nov 5, 2024
MYC081/pythia-2.8b-deduped-tldr-online-dpo
Updated
Nov 5, 2024
datasets
3
Sort: Recently updated
MYC081/deepscale_3b_eval_correct
Viewer
•
Updated
4 days ago
•
40.3k
•
52
MYC081/math_3b_eval_gpt_correct
Viewer
•
Updated
17 days ago
•
7.5k
•
206
MYC081/math_3b_eval_correct
Viewer
•
Updated
19 days ago
•
7.5k
•
73