Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YSKang 's Collections
3D
Documents
Generative
On-Device
LLM
Audio

Audio

updated Jul 8, 2024
Upvote
-

  • Music Consistency Models

    Paper • 2404.13358 • Published Apr 20, 2024 • 14

  • PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation

    Paper • 2407.02869 • Published Jul 3, 2024 • 21

  • LiveSpeech: Low-Latency Zero-shot Text-to-Speech via Autoregressive Modeling of Audio Discrete Codes

    Paper • 2406.02897 • Published Jun 5, 2024 • 16

  • Audio Mamba: Bidirectional State Space Model for Audio Representation Learning

    Paper • 2406.03344 • Published Jun 5, 2024 • 21
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs