| license: apache-2.0 | |
| tags: | |
| - trl | |
| - ddpo | |
| - diffusers | |
| - reinforcement-learning | |
| - text-to-image | |
| - stable-diffusion | |
| # TRL DDPO Model | |
| This is a diffusion model that has been fine-tuned with reinforcement learning to | |
| guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text. | |