Audio Conditioned LipSync with Latent Diffusion Models
Colorize black-and-white images with captions
Generate images using a prompt and input image
Generate high-quality images from prompts and input images
Generate realistic images from text prompts and images