Please add State of the Art proprietary models to OpenASR leaderboard

#37
by Buttermilk03 - opened

Add Gemini 2.5 Pro
According to the release paper Gemini 2.5 pro could be SOTA
(https://huggingface.co/google)

Add Soniox
Please add Soniox https://soniox.com/

Documentation for Async Transcription
https://soniox.com/docs/speech-to-text/get-started/transcribe-file

Documentation for realtime transcription
https://soniox.com/docs/speech-to-text/get-started/transcribe-realtime
https://huggingface.co/soniox

Add Amazon Nova Sonic
Please add Amazon Nova Sonic https://aws.amazon.com/de/ai/generative-ai/nova/speech/

API Documentation
https://docs.aws.amazon.com/nova/latest/userguide/speech.html

Add OpenAI gpt4o-transcribe
Please add OpenAI Gpt4o-transcribe https://openai.com/index/introducing-our-next-generation-audio-models/

Here is the API Documentation
https://platform.openai.com/docs/api-reference/audio/createTranscription

Add Wizper V3 from fal.ai
Please add https://fal.ai/models/fal-ai/wizper to the evaluation

API Documentation:
https://fal.ai/models/fal-ai/wizper/api

Add cartesia-ink
Please add https://cartesia.ai/ink to the evaluation

API doc
https://docs.cartesia.ai/2025-04-16/build-with-cartesia/models/stt#ink-whisper

Buttermilk03 changed discussion title from Please add State of the Art proprietary models to Please add State of the Art proprietary models to OpenASR bechmark
Buttermilk03 changed discussion title from Please add State of the Art proprietary models to OpenASR bechmark to Please add State of the Art proprietary models to OpenASR leaderboard

@Steveeeeeeen : What do you think, do you have plans to add more models?

There are just many lacking models. Most benchmarks are not independent. OpenASR is a relief here. I found one more benchmark here, https://voicewriter.io/speech-recognition-leaderboard , they also test the Gemini models where they are in the Top 3. The benchmark however is not as comprehensive as the OpenASR.

Hugging Face for Audio org

Hi @Buttermilk03 !
Thanks for the suggestion. For proprietary models, we usually wait for the developers to reach out and grant us credits to run their models. If that’s not the case, then we typically won’t add them.

Steveeeeeeen changed discussion status to closed

Soniox has 200 usd free credits anyway. This should be easily enough for the tests.
https://soniox.com/pricing

Gemini has a free tier as well with lower rate limits:
https://ai.google.dev/gemini-api/docs/pricing

Sign up or log in to comment