Convert GUI screen to structured elements
Generate spatial audio from images (and optionally text)
Greet someone by name!
Protein, molecule & more...
Generate high-quality music from text descriptions
Display OmniParser link and instructions
Upload an image and ask questions about it
Restore degraded audio using a Transformer-based model