Generate speech, scan text, and edit images
Identify and segment objects in images
Wait for task completion