HKUST Audio's picture

HKUST Audio PRO

HKUST-Audio

·

wxue_audio

AI & ML interests

Audio Generation

Organizations

upvoted an article 7 months ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

Feb 11

• 32

upvoted a paper 7 months ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6 • 27

upvoted a collection 7 months ago

Llasa

TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 11 items • Updated May 11 • 20

upvoted an article 8 months ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

By

•

Jan 20

• 72

upvoted a paper 8 months ago

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Paper • 2408.17175 • Published Aug 30, 2024 • 5

upvoted a paper about 1 year ago

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 33