Maybe Parameters frozen should be on Initial model? (last figure caption)
Oleg Dats
odats
AI & ML interests
None yet
Recent Activity
commented on
an
article
24 days ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
updated
a model
about 1 month ago
odats/ppo-LunarLander-v2
published
a model
about 1 month ago
odats/ppo-LunarLander-v2
Organizations
None yet
odats's activity
commented on
Illustrating Reinforcement Learning from Human Feedback (RLHF)
24 days ago