Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
15
8
Yanxiao Zhao
sdpkjc
Follow
qgallouedec's profile picture
fredericmenezes's profile picture
2 followers
·
9 following
https://sdpkjc.me
sdpkjc_adam
sdpkjc
yanxiao-zhao
AI & ML interests
Reinforcement Learning
Recent Activity
new
activity
12 days ago
xlangai/ubuntu_osworld_file_cache:
Fix update_browse_history_setup
new
activity
about 1 month ago
sdpkjc/SATQuest:
Update dataset card: Add paper link, task categories, and tags
authored
a paper
about 1 month ago
ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents
View all activity
Organizations
sdpkjc
's models
95
Sort: Recently updated
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Walker2d-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Walker2d-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Walker2d-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Walker2d-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Walker2d-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Swimmer-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/HalfCheetah-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/HalfCheetah-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/HalfCheetah-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/HalfCheetah-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Swimmer-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Swimmer-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Swimmer-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/HalfCheetah-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Swimmer-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Hopper-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Hopper-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Hopper-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Hopper-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Hopper-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
Jan 20, 2024
Previous
1
2
3
4
Next