Josh PRO
ACloudCenter
AI & ML interests
Real-Time AI applications, ASR, TTS, STT, and streaming media models
Recent Activity
replied to
their
post
about 21 hours ago
I've really been into testing the various ASR, TTS, and other audio related models. This space showcases the Nvidia Canary-Qwen 2.5B model. The model is able to transcribe incredibly fast and and combine qwen for queries about the transcript.
All audio example files were generated with my adjacent VibeVoice Conference Generator Space. Another really cool model!!
https://huggingface.co/spaces/ACloudCenter/canary-qwen-transcriber-2.5b
new activity
2 days ago
broadfield-dev/VibeVoice-demo-dev:ZeroGPU Timeout feedback
new activity
7 days ago
microsoft/VibeVoice-1.5B:The github repo is deleted