Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bezzam
's Collections
Multimodel audio
Neural codecs
Speech recognition datasets
Text-to-speech datasets
DigiCam (CelebA)
DiffuserCam Mirflickr
Multimodel audio
updated
7 days ago
Upvote
-
facebook/seamless-m4t-v2-large
Automatic Speech Recognition
•
2B
•
Updated
Jan 4, 2024
•
58.4k
•
887
stepfun-ai/Step-Audio-2-mini
Any-to-Any
•
8B
•
Updated
3 days ago
•
3.02k
•
206
bosonai/higgs-audio-v2-generation-3B-base
Text-to-Speech
•
6B
•
Updated
Jul 28
•
324k
•
595
Upvote
-
Share collection
View history
Collection guide
Browse collections