Speech Recognition Models
Collection
Models for Welsh language and bilingual speech recognition
•
15 items
•
Updated
Model Name: whisper-large-v3-ft-verbatim-cy-en-ct2
Model Description:
This model card describes whisper-large-v3-ft-verbatim-cy-en-ct2
, a conversion of the techiaith/whisper-large-v3-ft-verbatim-cy-en
fine-tuned OpenAI whisper model to the CTranslate2 format. This conversion allows for significantly faster and more efficient
inference, particularly on CPU and with batching.
How to Use:
from faster_whisper import WhisperModel
audio_file_path=<path to your audio file>
model = WhisperModel("techiaith/whisper-large-v3-ft-verbatim-cy-en-ct2")
segments, info = model.transcribe(audio_file_path, beam_size=5)
print("Detected language '%s' with probability %f" % (info.language, info.language_probability))
for segment in segments:
print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
Detected language 'cy' with probability 0.999987
[0.00s -> 4.24s] Dwi teimlo weithie unwaith ti'n cyfadda bo' na rwbath yn bod ma'n wir wedyn dydi?
Base model
openai/whisper-large-v3