reazonspeech-espnet-v1
reazonspeech-espnet-v1
is an ESPnet model trained for Japanese automatic speech recognition (ASR).
- This model was trained on 15,000 hours of ReazonSpeech corpus.
- Make sure that your audio file is sampled at 16khz when using this model.
For more details, please visit the official project page.
- Downloads last month
- 20
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.