arxiv:2105.06413
Alexey G
grib0ed0v
AI & ML interests
LLM / RLHF / AI4Everything.
Organizations
None yet
Papers
3
models
12
grib0ed0v/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
4
grib0ed0v/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
grib0ed0v/ppo-LunarLander-v2-unit8
Reinforcement Learning
•
Updated
grib0ed0v/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
72
grib0ed0v/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
•
2
grib0ed0v/ppo-PyramidsRND
Reinforcement Learning
•
Updated
•
9
grib0ed0v/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
32
grib0ed0v/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
grib0ed0v/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
grib0ed0v/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
6
datasets
None public yet