Automatic Speech Recognition
Transformers
Safetensors
whisper

This repository is using dedicated language code for different language (whisper's default behavior), which is a different to JackyHoCL/whisper-large-v3-turbo-cantonese-yue-english

2025-10-10: CER:

Dataset Evaluation Lang Dataset Lang Split CER(in %)
Training Mixed Mixed validation 5.839
mozilla-foundation/common_voice_22_0 yue yue test 2.383
mozilla-foundation/common_voice_17_0 yue yue test 2.134
mozilla-foundation/common_voice_17_0 en en test(2k samples) 5.68
mozilla-foundation/common_voice_16_1 zh zh-CN test 6.99
JackyHoCL/cleaned_mixed_cantonese_and_english_speech yue yue test 5.6
Downloads last month
79
Safetensors
Model size
0.8B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for JackyHoCL/whisper-large-v3-turbo-cantonese

Finetuned
(373)
this model

Datasets used to train JackyHoCL/whisper-large-v3-turbo-cantonese