Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kyutai
/
stt-2.6b-en
like
29
Follow
Kyutai
529
Automatic Speech Recognition
Moshi
Safetensors
English
stt
audio
arxiv:
2410.00037
License:
cc-by-4.0
Model card
Files
Files and versions
Community
1
Use this model
main
stt-2.6b-en
Ctrl+K
Ctrl+K
3 contributors
History:
9 commits
lmz
Create README.md
af4f55b
verified
3 days ago
.gitattributes
1.52 kB
initial commit
16 days ago
README.md
3.56 kB
Create README.md
3 days ago
config.json
1.26 kB
generation params (delay, prefix) in the config.json
3 days ago
[email protected]
385 MB
LFS
rename Mimi model (why do we use @ in the name)
16 days ago
model.safetensors
5.23 GB
LFS
Upload 2 files
9 days ago
tokenizer_en_audio_4000.model
59.3 kB
LFS
Initial model upload
16 days ago