Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Levin Zheng
LevinZheng
Follow
0 followers
·
6 following
AI & ML interests
None yet
Recent Activity
updated
a model
23 days ago
LevinZheng/rlhf_ppo_full
updated
a dataset
24 days ago
LevinZheng/aha-moment-7500
published
a dataset
24 days ago
LevinZheng/aha-moment-7500
View all activity
Organizations
None yet
LevinZheng
's models
21
Sort: Recently updated
LevinZheng/rlhf_ppo_full
Text Generation
•
8B
•
Updated
23 days ago
•
11
LevinZheng/aha-moment-3B-v2
3B
•
Updated
24 days ago
•
12
LevinZheng/edu_ppo_full_30k_150steps
8B
•
Updated
25 days ago
•
5
LevinZheng/edu_ppo_full_30k
8B
•
Updated
25 days ago
•
3
LevinZheng/edu_ppo_full_10k
8B
•
Updated
25 days ago
•
4
LevinZheng/edu_sft_full
Updated
27 days ago
•
1
LevinZheng/edu_dpo_full
Updated
27 days ago
LevinZheng/rlhf_ppo_full-Q8_0-GGUF
8B
•
Updated
Jul 10
•
1
LevinZheng/ppo-lunarlander-from0
Updated
Jun 20
LevinZheng/poca-SoccerTwos
Reinforcement Learning
•
Updated
May 28
•
5
LevinZheng/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
May 28
•
2
LevinZheng/ppo-Pyramids
Reinforcement Learning
•
Updated
May 28
•
4
LevinZheng/ppo-SnowballTarget
Reinforcement Learning
•
Updated
May 28
•
13
LevinZheng/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
May 27
LevinZheng/Reinforce-Cartpole-v1
Reinforcement Learning
•
Updated
May 27
LevinZheng/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
May 27
•
4
LevinZheng/q-Taxi-v3
Reinforcement Learning
•
Updated
May 27
LevinZheng/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 27
•
2
LevinZheng/aha-moment-3B
Text Generation
•
3B
•
Updated
May 13
•
5
LevinZheng/rlhf_dpo_full
Text Generation
•
8B
•
Updated
May 13
LevinZheng/rlhf_sft_full
Text Generation
•
8B
•
Updated
May 13