Oleg Dats's picture

Oleg Dats

odats

AI & ML interests

None yet

Recent Activity

commented on an article 24 days ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

updated a model about 1 month ago

odats/ppo-LunarLander-v2

published a model about 1 month ago

odats/ppo-LunarLander-v2

View all activity

Organizations

None yet

odats's activity

commented on Illustrating Reinforcement Learning from Human Feedback (RLHF) 24 days ago

Maybe Parameters frozen should be on Initial model? (last figure caption)

updated a model about 1 month ago

odats/ppo-LunarLander-v2

Reinforcement Learning • Updated Apr 18 • 6

published a model about 1 month ago

odats/ppo-LunarLander-v2

Reinforcement Learning • Updated Apr 18 • 6