Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

0x05a4
/
DeepRL-PPO-LLv2

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card Files Files and versions Community
DeepRL-PPO-LLv2
Ctrl+K
Ctrl+K
  • 1 contributor
History: 6 commits
0x05a4's picture
0x05a4
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
ab2dd36 about 2 years ago
  • LunarLander-v2-PPO-305
    Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
  • LunarLander-v2-PPO
    Baseline: LR=3e-4/.996, epochs=2e6 about 3 years ago
  • .gitattributes
    1.22 kB
    Baseline 1M epochs about 3 years ago
  • LunarLander-v2-PPO-305.zip
    147 kB
    LFS
    Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
  • LunarLander-v2-PPO.zip
    146 kB
    LFS
    Baseline: LR=3e-4/.996, epochs=2e6 about 3 years ago
  • README.md
    784 Bytes
    Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
  • config.json
    14.4 kB
    Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
  • replay.mp4
    158 kB
    LFS
    Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago
  • results.json
    157 Bytes
    Baseline: LR=5e-4/cosine-100, epochs=1e7/305 about 2 years ago