Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
7
Wenkai Yang
PRO
Keven16
Follow
dongguanting's profile picture
AronYang's profile picture
wyzjack's profile picture
4 followers
·
3 following
https://keven980716.github.io/
keven980716
AI & ML interests
None yet
Recent Activity
published
a model
3 days ago
Keven16/Qwen2.5-32B-TOPS-Iter-DPO-Preview
published
a model
3 days ago
Keven16/Qwen2.5-32B-TOPS-Iter-DPO
upvoted
a
paper
4 days ago
Agentic Reinforced Policy Optimization
View all activity
Organizations
None yet
Papers
10
arxiv:
2505.00662
arxiv:
2502.18080
arxiv:
2406.11431
arxiv:
2404.02406
Expand 10 papers
models
10
Sort: Recently updated
Keven16/DeepCritic-7B-RL1.5-PRM800K
8B
•
Updated
Jun 25
•
2
Keven16/DeepCritic-7B-RL1.5-Numina
8B
•
Updated
Jun 23
•
3
Keven16/Qwen2.5-32B-TOPS-Iter-DPO-Preview
33B
•
Updated
May 15
•
1
Keven16/Qwen2.5-32B-TOPS
33B
•
Updated
May 15
•
3
Keven16/Qwen2.5-32B-TOPS-Iter-DPO
33B
•
Updated
May 15
•
1
Keven16/Qwen2.5-32B-Tag
33B
•
Updated
May 15
•
3
Keven16/LLaMA3.1-8B-Tag
8B
•
Updated
May 15
•
3
Keven16/DeepCritic-7B-RL-PRM800K
8B
•
Updated
May 12
•
4
Keven16/DeepCritic-7B-RL-Numina
8B
•
Updated
May 12
•
6
Keven16/DeepCritic-7B-SFT
8B
•
Updated
May 12
•
5
datasets
2
Sort: Recently updated
Keven16/DeepCritic-RL-Data
Viewer
•
Updated
May 13
•
55k
•
5
Keven16/DeepCritic-4.5K
Preview
•
Updated
May 13
•
10