Josh's picture

5 10

Josh PRO

ACloudCenter

·

AI & ML interests

Real-Time AI applications, ASR, TTS, STT, and streaming media models

Recent Activity

replied to their post 2 days ago

I've really been into testing the various ASR, TTS, and other audio related models. This space showcases the Nvidia Canary-Qwen 2.5B model. The model is able to transcribe incredibly fast and and combine qwen for queries about the transcript. All audio example files were generated with my adjacent VibeVoice Conference Generator Space. Another really cool model!! https://huggingface.co/spaces/ACloudCenter/canary-qwen-transcriber-2.5b

new activity 4 days ago

broadfield-dev/VibeVoice-demo-dev:ZeroGPU Timeout feedback

new activity 9 days ago

microsoft/VibeVoice-1.5B:The github repo is deleted

View all activity

Organizations

ACloudCenter 's Spaces 2

Running on Zero

Canary Qwen Transcriber 2.5b

Transcribe audio and ask questions about the transcript

Running on Zero

ACE Step

A Step Towards Music Generation Foundation Model