Generate images from text or transform images with text prompts
Process audio and generate text output based on instructions