Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

mistralai
/
Voxtral-Small-24B-2507

Audio-Text-to-Text
Safetensors
vllm
voxtral
Model card Files Files and versions
xet
Community
20
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

why consolidated.safetensors is required?

3
#20 opened 11 days ago by
Hansen-Wu

Quantised Version

🔥 1
1
#19 opened 15 days ago by
steee

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:3!

#17 opened 18 days ago by
hoseongahn

ValueError: There is no module or parameter named 'mm_whisper_embeddings' in LlamaForCausalLM

#15 opened 20 days ago by
dingyuansheng

Update README.md

#14 opened 21 days ago by
arafapollo

correct typo in apply_transcription_request

#13 opened 21 days ago by
revvvxx

Really appreciate the work you put into this.🤍

🔥 3
#12 opened 23 days ago by
deep-div

Add support to llama.cpp

👍 3
#11 opened 24 days ago by
wraps

Improve model card: Update library, add paper link, abstract summary, and refine tags

#10 opened 24 days ago by
nielsr

Does this model support streaming ASR recognition, or are there any plans to open-source a streaming model?

4
#8 opened 27 days ago by
Qoboty

Large audio files

1
#7 opened 28 days ago by
nherve
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs