Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Karan Thakkar's picture
1 3

Karan Thakkar

carankt
  • carankt
  • carankt

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer
authored a paper 7 days ago
CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech
upvoted a paper 7 days ago
CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech
View all activity

Organizations

Johns Hopkins University's profile picture

carankt's activity

authored 2 papers 7 days ago

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer

Paper • 2409.08425 • Published Sep 12, 2024 • 10

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published 11 days ago • 7
upvoted a paper 7 days ago

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published 11 days ago • 7
liked a model 7 days ago

OpenSound/CapSpeech-models

Updated 10 days ago • 5
liked a Space 7 days ago
Running on Zero
55
55

CapSpeech TTS

🧢

Stylized TTS – design voice, accent, and emotion your way

updated a dataset 9 months ago

carankt/DreamVoiceDB

Preview • Updated Sep 20, 2024 • 25
liked a dataset over 1 year ago

CSTR-Edinburgh/vctk

Updated Aug 14, 2024 • 547 • 43
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs