Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone.

Fixie.ai
Enterprise
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Fixie.ai: We're building AIs that can communicate as naturally as humans
Human communication is messy. We interrupt, talk over each other, and don't always wait our turn. But this rapid, messy exchange of ideas serves as the backbone of human progress.
LLMs are revolutionary, but their potential impact is currently limited to situations where text-based chat is sufficient.
We think useful, productive, and accessible AGI will require models that can operate in the fast-paced, ambiguous world of human voice communication.
This is the problem we're tackling. If that sounds interesting, check out our SLM (Speech Language Model), Ultravox!
models
14

fixie-ai/ultravox-v0_3-llama-3_2-1b
Audio-Text-to-Text
•
Updated
•
234k
•
6

fixie-ai/ultravox-v0_5-llama-3_3-70b
Audio-Text-to-Text
•
Updated
•
4.03k
•
9

fixie-ai/ultravox-v0_5-llama-3_2-1b
Audio-Text-to-Text
•
Updated
•
36.2k
•
8

fixie-ai/ultravox-v0_5-llama-3_1-8b
Audio-Text-to-Text
•
Updated
•
2.41k
•
8

fixie-ai/ultravox-v0_4-mistral_nemo
Updated
•
53

fixie-ai/ultravox-v0_2
Audio-Text-to-Text
•
Updated
•
936
•
51

fixie-ai/ultravox-v0_4-llama-3_1-70b
Audio-Text-to-Text
•
Updated
•
108
•
5

fixie-ai/ultravox-v0_4-ToolACE-8B
Audio-Text-to-Text
•
Updated
•
36
•
1

fixie-ai/ultravox-v0_4_1-llama-3_1-70b
Audio-Text-to-Text
•
Updated
•
4.69k
•
24

fixie-ai/ultravox-v0_4_1-mistral-nemo
Audio-Text-to-Text
•
Updated
•
680
•
24
datasets
27
fixie-ai/ultravox-endpointing-02-19-25
Updated
•
10
fixie-ai/endpointing
Viewer
•
Updated
•
260k
•
394
•
1
fixie-ai/endpointing-audio
Viewer
•
Updated
•
1.65k
•
304
•
3
fixie-ai/slue_p2_sqa5_test
Viewer
•
Updated
•
408
•
185
fixie-ai/dream_tts_mcq_test
Viewer
•
Updated
•
1.91k
•
202
fixie-ai/public_sg_speech_qa_test
Viewer
•
Updated
•
688
•
183
fixie-ai/cn_college_listen_mcq_test
Viewer
•
Updated
•
2.27k
•
225
fixie-ai/big_bench_audio
Viewer
•
Updated
•
1k
•
543
fixie-ai/cv-multi-lingual-phrases
Viewer
•
Updated
•
1M
•
420
•
1
fixie-ai/common_voice_17_0
Viewer
•
Updated
•
11.4M
•
96.3k
•
7