Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Josh's picture
5 10

Josh PRO

ACloudCenter
austrian11's profile picture JKB213's profile picture johnnyb0y's profile picture
ยท

AI & ML interests

Real-Time AI applications, ASR, TTS, STT, and streaming media models

Recent Activity

replied to their post 2 days ago
I've really been into testing the various ASR, TTS, and other audio related models. This space showcases the Nvidia Canary-Qwen 2.5B model. The model is able to transcribe incredibly fast and and combine qwen for queries about the transcript. All audio example files were generated with my adjacent VibeVoice Conference Generator Space. Another really cool model!! https://huggingface.co/spaces/ACloudCenter/canary-qwen-transcriber-2.5b
new activity 4 days ago
broadfield-dev/VibeVoice-demo-dev:ZeroGPU Timeout feedback
new activity 9 days ago
microsoft/VibeVoice-1.5B:The github repo is deleted
View all activity

Organizations

A Cloud Center's profile picture

ACloudCenter 's Spaces 2

Running on Zero
3

Canary Qwen Transcriber 2.5b

๐Ÿ“

Transcribe audio and ask questions about the transcript

9 days ago
Running on Zero
1

ACE Step

๐Ÿ˜ป

A Step Towards Music Generation Foundation Model

12 days ago
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs