0x05a4
/

DeepRL-PPO-LLv2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

DeepRL-PPO-LLv2

Ctrl+K

Ctrl+K

1 contributor

History: 6 commits

0x05a4's picture

Baseline: LR=5e-4/cosine-100, epochs=1e7/305

ab2dd36 about 2 years ago

LunarLander-v2-PPO-305
Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
LunarLander-v2-PPO
Baseline: LR=3e-4/.996, epochs=2e6 about 3 years ago
.gitattributes

1.22 kB

Baseline 1M epochs about 3 years ago
LunarLander-v2-PPO-305.zip

147 kB
LFS

Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
LunarLander-v2-PPO.zip

146 kB
LFS

Baseline: LR=3e-4/.996, epochs=2e6 about 3 years ago
README.md

784 Bytes

Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
config.json

14.4 kB

Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
replay.mp4

158 kB
LFS

Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
results.json

157 Bytes

Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago