Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Jin Zhu
mamba413
Follow
Kyleyee's profile picture
1 follower
·
1 following
https://mamba413.github.io/
Mamba413
AI & ML interests
reinforcement learning
Recent Activity
updated
a dataset
9 days ago
mamba413/GenerateText_HH_Seed1
updated
a dataset
10 days ago
mamba413/GenerateText_HH_Seed1_new
published
a dataset
10 days ago
mamba413/GenerateText_HH_Seed1_new
View all activity
Organizations
None yet
models
10
Sort: Recently updated
mamba413/Qwen2.5-1.5B-PPO-DR-HH-Seed1
Updated
13 days ago
•
5
mamba413/Qwen2.5-1.5B-PPO-BENCH-HH-Seed1
Updated
13 days ago
•
4
mamba413/Qwen2.5-1.5B-Reward-BENCH-HH-Seed1
Updated
13 days ago
•
5
mamba413/Qwen2.5-1.5B-Reward-BENCH-HH-Seed0
Updated
14 days ago
mamba413/Qwen2.5-1.5B-Reward-DR-HH-Seed0
Updated
14 days ago
mamba413/Qwen2-0.5B-Reward-DR-HH-Seed0
Text Classification
•
Updated
16 days ago
•
1
mamba413/Qwen2.5-1.5B-Reward-DR-IMDB-Seed0
Updated
16 days ago
mamba413/Qwen2.5-1.5B-Reward-DR-SIMU-Seed0
Updated
16 days ago
mamba413/Qwen2-0.5B-Reward-DR-SIMU-Seed0
Text Classification
•
Updated
18 days ago
•
1
mamba413/Qwen2-0.5B-Reward-DR-SIMU
Text Classification
•
Updated
19 days ago
•
4
datasets
7
Sort: Recently updated
mamba413/GenerateText_HH_Seed1
Viewer
•
Updated
9 days ago
•
11.8k
•
82
mamba413/GenerateText_HH_Seed1_new
Viewer
•
Updated
10 days ago
•
640
•
33
mamba413/RewardModel-BENCH-HH-Seed1
Viewer
•
Updated
11 days ago
•
64
•
30
mamba413/RewardModel-DR-HH-Seed1
Viewer
•
Updated
11 days ago
•
64
•
26
mamba413/train_data_imdb_simu_valid
Viewer
•
Updated
18 days ago
•
48.1k
•
36
mamba413/train_data_imdb_simu
Viewer
•
Updated
19 days ago
•
48.1k
•
83
mamba413/train_data_imdb
Viewer
•
Updated
Mar 3
•
2
•
56