SeanForHim
/

ddpo-finetuned-stable-diffusion

StableDiffusionPipeline

reinforcement-learning

stable-diffusion

Model card Files Files and versions

ddpo-finetuned-stable-diffusion / README.md

SeanForHim's picture

Push model using huggingface_hub.

0d9d17b verified over 1 year ago

|

history blame contribute delete

363 Bytes

	---
	license: apache-2.0
	tags:
	- trl
	- ddpo
	- diffusers
	- reinforcement-learning
	- text-to-image
	- stable-diffusion
	---

	# TRL DDPO Model

	This is a diffusion model that has been fine-tuned with reinforcement learning to
	guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text.