@ACloudCenter on Hugging Face: "I've really been into testing the various ASR, TTS, and other audio related…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update Sep 4, 2025

Post

1911

I've really been into testing the various ASR, TTS, and other audio related models. This space showcases the Nvidia Canary-Qwen 2.5B model. The model is able to transcribe incredibly fast and and combine qwen for queries about the transcript.

All audio example files were generated with my adjacent VibeVoice Conference Generator Space. Another really cool model!!
ACloudCenter/canary-qwen-transcriber-2.5b

johnnyb0y

Sep 10, 2025

Hey! Would love to chat with you about this and the speech work you're doing! ( my LI in profile or lmk where to email you

ACloudCenter

Sep 11, 2025

Sure thing. I'll add you.

In this post

ACloudCenter Josh
johnnyb0y Joh Bo