Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenkai Yang's picture
2 7

Wenkai Yang PRO

Keven16
dongguanting's profile picture AronYang's profile picture wyzjack's profile picture
·
https://keven980716.github.io/
  • keven980716

AI & ML interests

None yet

Recent Activity

published a model 3 days ago
Keven16/Qwen2.5-32B-TOPS-Iter-DPO-Preview
published a model 3 days ago
Keven16/Qwen2.5-32B-TOPS-Iter-DPO
upvoted a paper 4 days ago
Agentic Reinforced Policy Optimization
View all activity

Organizations

None yet

Papers 10

arxiv:2505.00662
arxiv:2502.18080
arxiv:2406.11431
arxiv:2404.02406

models 10

Keven16/DeepCritic-7B-RL1.5-PRM800K

8B • Updated Jun 25 • 2

Keven16/DeepCritic-7B-RL1.5-Numina

8B • Updated Jun 23 • 3

Keven16/Qwen2.5-32B-TOPS-Iter-DPO-Preview

33B • Updated May 15 • 1

Keven16/Qwen2.5-32B-TOPS

33B • Updated May 15 • 3

Keven16/Qwen2.5-32B-TOPS-Iter-DPO

33B • Updated May 15 • 1

Keven16/Qwen2.5-32B-Tag

33B • Updated May 15 • 3

Keven16/LLaMA3.1-8B-Tag

8B • Updated May 15 • 3

Keven16/DeepCritic-7B-RL-PRM800K

8B • Updated May 12 • 4

Keven16/DeepCritic-7B-RL-Numina

8B • Updated May 12 • 6

Keven16/DeepCritic-7B-SFT

8B • Updated May 12 • 5

datasets 2

Keven16/DeepCritic-RL-Data

Viewer • Updated May 13 • 55k • 5

Keven16/DeepCritic-4.5K

Preview • Updated May 13 • 10
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs