Detect objects in images and get bounding boxes
Convert your face photo into anime style
Transcribe audio from microphone, file, or YouTube link
Generate personalized images with a face preservation
Generate images from text descriptions
Generate edited images with prompts
Execute commands based on environment variables
Generate high-resolution images with text prompts
perfect ocr vlm
Analyze image to generate descriptive prompt
Convert PDFs and images to Markdown and more
Generate corrected text with reference
Generate customized images using text and an ID image
CPU powered, low RTF, emotional, multilingual TTS