Generate images in the style of a reference image
Deeply interrogate audio file content
Dense Grounded Understanding of Images and Videos
Memory-Guided Diffusion for Expressive Talking Video Gen
Audio Conditioned LipSync with Latent Diffusion Models
Blind Image Restoration with Instant Generative Reference
Generate a fictional story from an image
High-Fidelity Simultaneous Speech-To-Speech Translation
Gaze Target Estimation
Audio-Driven Portrait Animations
Audio Gen, Audio Style Transfer and Audio InPainting
Convert an audio file to a spectrogram image
Create 3D images from stereo pairs
Extract vocals and generate lyrics from a song
Magnify subject details and enhance image quality
Generate music from audio tracks
Generate images from text prompts
Generate Talking avatars from Text-to-Speech
Segment objects in images by selecting points
Segment and track objects in a video
Create images of a given character in different poses