File size: 1,651 Bytes
			
			05cfc76 764d5ad 05cfc76 431dae7 764d5ad 431dae7 d341237 520adc6 d341237 431dae7 764d5ad 431dae7 520adc6 431dae7 520adc6 431dae7  | 
								1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75  | 
								---
license: mit
library_name: pytorch
pipeline_tag: reinforcement-learning
language:
  - en
tags:
  - reinforcement-learning
  - gymnasium
  - CartPole-v1
  - ppo
  - pytorch
model-index:
  - name: CartPole-v1_ppo
    results:
      - task:
          type: reinforcement-learning
          name: Reinforcement Learning
        dataset:
          name: CartPole-v1
          type: gymnasium
        metrics:
          - name: Best Eval Reward
            type: reward
            value: 272.3999938964844
          - name: Current Eval Reward
            type: reward
            value: 500.0
          - name: Epoch
            type: epoch
            value: 199.0
          - name: Total Timesteps
            type: timesteps
            value: 0.0
---
# CartPole-v1_ppo
Run: `cvb5lyfw` — Env: `CartPole-v1` — Algo: `ppo`
This repository contains artifacts from a Gymnasium Solver training run.
## Contents
- Config: `artifacts/configs/config.json`
- Checkpoints: `artifacts/checkpoints/*.ckpt`
- Logs: `artifacts/logs/*.log`
- Video: `artifacts/videos/**/best_checkpoint.mp4` (also previewed below)
## Preview
<video controls src="preview.mp4" width="480"></video>
If the video above doesn't load, try the fallback: [replay.mp4](replay.mp4)
## Config (excerpt)
```json
{
  "env_id": "CartPole-v1",
  "algo_id": "ppo",
  "n_steps": 32,
  "batch_size": 256,
  "n_epochs": 20,
  "n_timesteps": 100000.0,
  "seed": 42,
  "n_envs": 8,
  "obs_type": "rgb",
  "policy": "MlpPolicy",
  "learning_rate": 0.001,
  "gamma": 0.98,
  "gae_lambda": 0.8,
  "ent_coef": 0.0,
  "vf_coef": 0.5,
  "clip_range": 0.2,
  "normalize_advantages": "batch"
}
``` |