Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
maitrix-org
/
Voila-base
like
10
Follow
Maitrix.org
57
Audio-to-Audio
Transformers
Safetensors
maitrix-org/Voila-Benchmark
maitrix-org/Voila-million-voice
6 languages
llama
audio
audio-language-model
speech-recognition
text-to-speech
speech-conversation
text-generation-inference
arxiv:
2505.02707
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Voila-base
/
README.md
Commit History
Update README.md
8604b7a
verified
shiyemin2
commited on
10 days ago
Update README.md
065d5bd
verified
zhitinghu
commited on
10 days ago
Update README.md
dac7b5d
verified
zhitinghu
commited on
10 days ago
Update GitHub Pages URLs in README.md
6b53a81
verified
wuqing157
commited on
11 days ago
Update README.md
ada60c2
verified
shiyemin2
commited on
13 days ago
Update README.md
cf466d6
verified
shiyemin2
commited on
19 days ago
Update README.md
0947cf2
verified
shiyemin2
commited on
Mar 24
Upload tokenizer
b8d5143
verified
shiyemin2
commited on
Mar 18