Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
ACloudCenter 
posted an update 20 days ago
Post
1791
I've really been into testing the various ASR, TTS, and other audio related models. This space showcases the Nvidia Canary-Qwen 2.5B model. The model is able to transcribe incredibly fast and and combine qwen for queries about the transcript.

All audio example files were generated with my adjacent VibeVoice Conference Generator Space. Another really cool model!!
ACloudCenter/canary-qwen-transcriber-2.5b

Hey! Would love to chat with you about this and the speech work you're doing! ( my LI in profile or lmk where to email you

·

Sure thing. I'll add you.

In this post