Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10605.0
TFLOPS
12
8
Adam Yanxiao Zhao
sdpkjc
Follow
fredericmenezes's profile picture
qgallouedec's profile picture
2 followers
·
9 following
https://sdpkjc.com
sdpkjc_adam
sdpkjc
yanxiao-zhao
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a dataset
about 7 hours ago
sdpkjc/SATQuest-Base-n4_16-a1_8-92800-9280
published
a dataset
about 7 hours ago
sdpkjc/SATQuest-Base-n4_16-a1_8-92800-9280
updated
a dataset
about 10 hours ago
sdpkjc/SATQuest-Base-sat-n4_4-a1_8-3960
View all activity
Organizations
Papers
2
arxiv:
2403.00673
arxiv:
2402.03046
models
98
Sort: Recently updated
sdpkjc/Qwen2.5-1.5B-Instruct-FT-DPO
Text Generation
•
Updated
Jan 22
•
28
sdpkjc/SmolLM2-FT-DPO
Text Generation
•
Updated
Jan 22
•
11
sdpkjc/SmolLM2-FT-MyDataset
Text Generation
•
Updated
Jan 21
•
11
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
Expand 98 models
datasets
11
Sort: Recently updated
sdpkjc/SATQuest-Base-n4_16-a1_8-92800-9280
Viewer
•
Updated
about 7 hours ago
•
102k
sdpkjc/SATQuest-Base-sat-n4_4-a1_8-3960
Viewer
•
Updated
about 10 hours ago
•
3.96k
sdpkjc/NumBase-N01-S2g-B2g
Viewer
•
Updated
26 days ago
•
983k
•
70
sdpkjc/NumBase-N01-S2g-B28
Viewer
•
Updated
26 days ago
•
459k
•
63
sdpkjc/NumBase-N01-S2g-B24
Viewer
•
Updated
26 days ago
•
197k
•
60
sdpkjc/NumBase-N01-S28-B2g
Viewer
•
Updated
26 days ago
•
3.81k
•
64
sdpkjc/NumBase-N01-S28-B28
Viewer
•
Updated
26 days ago
•
1.78k
•
79
sdpkjc/NumBase-N01-S28-B24
Viewer
•
Updated
26 days ago
•
762
•
60
sdpkjc/NumBase-N01-S24-B2g
Viewer
•
Updated
26 days ago
•
210
•
77
sdpkjc/NumBase-N01-S24-B28
Viewer
•
Updated
26 days ago
•
98
•
62
Expand 11 datasets