Text-to-speech using Gradio, FastAPI, and Chatterbox TTS
Expressive Zeroshot TTS
Suggest meal menus based on occasion
Remove background from images
Describe objects in webcam feed
Generates a podcast about today's top trending paper.
Use the FLUX-Pro model as much as you want.
image2mesh
Extracts product images, and measurements.
Generate 3D models and videos from images
Translate text into different languages
Generate depth estimation map from images
Create 3D reconstructions from videos or images
Generate music from text and melody
VGGT (CVPR 2025)
Generate music from text descriptions