tsilva
/

CartPole-v1_ppo

Reinforcement Learning

Model card Files Files and versions

CartPole-v1_ppo

Run: cvb5lyfw — Env: CartPole-v1 — Algo: ppo

This repository contains artifacts from a Gymnasium Solver training run.

Contents

Config: artifacts/configs/config.json
Checkpoints: artifacts/checkpoints/*.ckpt
Logs: artifacts/logs/*.log
Video: artifacts/videos/**/best_checkpoint.mp4 (also previewed below)

Preview

If the video above doesn't load, try the fallback: replay.mp4

Config (excerpt)

{
  "env_id": "CartPole-v1",
  "algo_id": "ppo",
  "n_steps": 32,
  "batch_size": 256,
  "n_epochs": 20,
  "n_timesteps": 100000.0,
  "seed": 42,
  "n_envs": 8,
  "obs_type": "rgb",
  "policy": "MlpPolicy",
  "learning_rate": 0.001,
  "gamma": 0.98,
  "gae_lambda": 0.8,
  "ent_coef": 0.0,
  "vf_coef": 0.5,
  "clip_range": 0.2,
  "normalize_advantages": "batch"
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning

loading

Evaluation results

Best Eval Reward on CartPole-v1
self-reported

272.400
Current Eval Reward on CartPole-v1
self-reported

500.000
Epoch on CartPole-v1
self-reported

199.000
Total Timesteps on CartPole-v1
self-reported

0.000

View on Papers With Code