fixie-ai
/

ultravox-v0_2

Audio-Text-to-Text

feature-extraction

Model card Files Files and versions

farzadab commited on Jul 11, 2024

Commit

81f5118

·

verified ·

1 Parent(s): 2eebed8

Update README.md

Files changed (1) hide show

README.md +19 -2

README.md CHANGED Viewed

@@ -35,9 +35,26 @@ No preference tuning has been applied to this revision of the model.
 - **Repository:** https://ultravox.ai
 - **Demo:** See repo
-## Uses
-Voice agents, speech-to-speech translation, analysis of spoken audio
 ## Training Details

 - **Repository:** https://ultravox.ai
 - **Demo:** See repo
+## Usage
+Think of the model as an LLM that can also hear and understand speech. As such, it can be used as a voice agent,  and also to do speech-to-speech translation, analysis of spoken audio, etc.
+To use the model, try the following:
+```python
+# pip install transformers peft librosa
+import transformers
+import numpy as np
+import librosa
+pipe = transformers.pipeline(model='fixie-ai/ultravox-v0_2', trust_remote_code=True)
+path = "<path-to-input-audio>"  # TODO: pass the audio here
+audio, sr = librosa.load(path, sr=16000)
+pipe({'audio': audio_array, prompt='<|audio|>', 'sampling_rate': sr}, max_new_tokens=30)
+```
 ## Training Details