alytts
Generate speech from text using OpenAI API
Generate speech from text using OpenAI API
Text-to-Speech, Speech-to-Text, and Language Recognition
Display interactive web applications using H2O Wave
Clone a voice using a text and audio sample
Generate audio from text using pre-trained models
Create custom voice clones using text input
Create interactive music playlists with AI assistance
Convert audio to text using a model
Generate audio effects from video using image caption
Generate voice from text with customizable audio source
A demo of MetaVoice 1B, a new TTS model by MetaVoice.
Transcribe audio to text using selected models
Convert text to speech
Run a web-based application
Convert audio to text
Convert voice to another voice
Generate or edit spoken audio from text
High-fidelity Text-To-Speech
In-browser text-to-music w/ Transformers.js!
Convert text to speech
Search... music by typing a description
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
Generate music powered by AI
Voice Chat with JARVIS
Run Whisper in Browser
Convert voice to text
Generate audio by cloning a voice
Generate speech from text using ElevenLabs voices
TTS, STT
Voice chat with AI that has web access
Clone voice to say text
Generate audio from text prompts
Transcribe or translate audio files
Generate speech from text
Generate a modified audio track and beat image from an uploaded song
Transcribe audio to text in the browser
In-browser speech recognition w/ word-level timestamps
Separate different speakers in an audio conversation
Transcribe audio to text with speaker diarization
Generate speech from text using various voices
easy download youtube audios with gradio
Interact with a multimodal chatbot using text and audio
Transform a report or document into an interview/discussion
Convert text to audio and vice versa
Generate high-quality music from text descriptions
ML-powered speech recognition directly in your browser
Convert audio to text with ease and accuracy.
Restore degraded audio using a Transformer-based model
Generate audio from text using selected character voices
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Whisper Transcribe MP3 files, use a GPU to convert faster!
An end-to-end (e2e) Voice Language Model by Fish Audio.
Vocal and background audio separator
Audio-Driven Portrait Animations
Fixed fork of the original audio sr!
base model for mono-channel completion
Generate speech from text with or without voice cloning
Generate speech from text
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
Transform audio into text using a web-based model
Real-time in-browser speech recognition
Video to Audio
High-quality speech synthesis powered by Kokoro TTS
Translate and synthesize speech to English
Make Custom Voices With KokoroTTS
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Analyze music to identify genre, instrument, mood, and more
Interpret audio based on text instructions
Generate Podcast using Kokoro-TTS!
A text-to-speech model powered by SparkAudio and Mobvoi.
Blazingly Fast and Embarrassingly Simple Song Generation
Answer in speech