microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition โข Updated 7 days ago โข 800k โข 1.3k
view post Post 13032 We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. โก๏ธGenerate 10 seconds of speech in ~1 second for $0.What will you build? ๐ฅ webml-community/kokoro-webgpuThe most difficult part was getting the model running in the first place, but the next steps are simple:โ๏ธ Implement sentence splitting, allowing for streamed responses๐ Multilingual support (only phonemization left)Who wants to help? See translation 11 replies ยท ๐ฅ 31 31 ๐ 14 14 ๐ 7 7 ๐ค 5 5 ๐ 2 2 + Reply
Running 302 302 Kokoro Text-to-Speech (WebGPU) ๐ฃ High-quality speech synthesis powered by Kokoro TTS