Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OliP 's Collections
NewGen small LMs
Leading Leaderboards
2024 Papers of the year
2023 (and before) Papers of the Year
LLM Deployment
Vision-Language
Long-Context
Audio
Special LMs <10B
🌶️ Spaces
Evaluation
Applications
Coding

Audio

updated Dec 19, 2024
Upvote
-

  • Stable Audio Open

    Paper • 2407.14358 • Published Jul 19, 2024 • 27

  • Qwen2-Audio Technical Report

    Paper • 2407.10759 • Published Jul 15, 2024 • 60

  • kyutai/moshiko-pytorch-bf16

    Updated Sep 18, 2024 • 165k • 176

  • Presto! Distilling Steps and Layers for Accelerating Music Generation

    Paper • 2410.05167 • Published Oct 7, 2024 • 18

  • OuteAI/OuteTTS-0.1-350M

    Text-to-Speech • Updated about 1 month ago • 238 • 301

  • Foundation Models for Music: A Survey

    Paper • 2408.14340 • Published Aug 26, 2024 • 45

  • fishaudio/fish-speech-1.5

    Text-to-Speech • Updated Mar 25 • 8.31k • 567
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs