P. Wang PRO
Addwater
AI & ML interests
Quantum Computing, AI
Recent Activity
upvoted
a
paper
21 days ago
LLM4SR: A Survey on Large Language Models for Scientific Research
upvoted
a
paper
21 days ago
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
upvoted
a
paper
21 days ago
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Though
Organizations
None yet
Collections
1
models
17
Addwater/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Addwater/pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Addwater/LunarLander-v2-PPO
Reinforcement Learning
•
Updated
Addwater/Pyramids
Reinforcement Learning
•
Updated
•
4
Addwater/cartpole-v1
Reinforcement Learning
•
Updated
Addwater/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Addwater/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
•
1
Addwater/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
1
Addwater/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
9
Addwater/rl-course-unit4-PixelCopter
Reinforcement Learning
•
Updated
datasets
None public yet