Blind vote on HF TTS models!
High-quality speech synthesis powered by Kokoro TTS
Upgraded to v1.0!
FitDiT is a high-fidelity virtual try-on model.
Detect and annotate poses in images and videos
β¨[With v1.0.0] Accelerated TTS on Kokoro-82M
Extract clothing from images using a mask
Vision Transformer Attention Visualization
Generate images with virtual try-on or pose transfer
Image Super-resolution via Diffusion Inversion