How to fix "TypeError: expected str, bytes or os.PathLike object, not NoneType" when specifying the local whisper model
2
#65 opened over 1 year ago
by
BenjaminChu
Speaker Embedding
3
2
#64 opened over 1 year ago
by
bertrand-fournel
whisper jax diarization Icelandic
#62 opened over 1 year ago
by
Dondada79
Translating English Audio Into Spanish Text
4
#61 opened over 1 year ago
by
stvnchnsn

Error with word level timestamps - ValueError: set return_segments=True
5
#60 opened over 1 year ago
by
dkincaid

Passing parameters to the model deployed on HF Inference Endpoints
3
#59 opened over 1 year ago
by
dkincaid

Does whisper-large-v3 work on Sagemaker?
3
#58 opened over 1 year ago
by
dkincaid

Which File Shall I Download from The Files and Versions
2
#57 opened over 1 year ago
by
BenjaminChu
Transcribing multiple languages in single audio file
3
#56 opened over 1 year ago
by
supercharge19
Is there any way we can get no_speech_probability from the pipeline?
1
1
#55 opened over 1 year ago
by
rizwanishaq
how to handle input audio files with either white noise or general noise and no speech
2
#54 opened over 1 year ago
by
unk1911
changed use_flash_attention_2=True to attn_implementation="flash_attention_2"
1
#53 opened over 1 year ago
by
macadeliccc

Whisper parameters?
1
1
#52 opened over 1 year ago
by
Megatron17
Whisper large v3 can not recognize speech after Fine-Tune
5
#51 opened over 1 year ago
by
bardenthenry
how download large version?
1
#50 opened over 1 year ago
by
Timnorth
I've noticed that Uyghur language is not available. Is it possible to add the Uyghur (ug) dataset from the Mozilla Foundation's Common Voice 13.0 for training?
2
#48 opened over 1 year ago
by
almjanx
is parallel processing possible with DLC Deployement?
#47 opened over 1 year ago
by
SharatChandra
Error: cuDNN error: CUDNN_STATUS_NOT_INITIALIZED
#46 opened over 1 year ago
by
duzm
Update README.md
1
#44 opened over 1 year ago
by
Mucahit01
Training for translation
1
13
#43 opened over 1 year ago
by
tirtohadi
Whisper-Large-V3 does not work with explicit use of dtype which is given in config.json
2
#42 opened over 1 year ago
by
ait-paca
Update README.md
#41 opened over 1 year ago
by
joey1895
What azure compute instance will work?
#40 opened over 1 year ago
by
Jayantabasumatary1
Getting pipeline time estimate
1
1
#39 opened over 1 year ago
by
wwilcz
how to evaluate use model-00001-of-00002.safetensors and model-00002-of-00002.safetensors
5
#36 opened over 1 year ago
by
andyweiqiu
Does anyone know how to tag the speaker with Whisper?
4
#35 opened over 1 year ago
by
Shaunnnnn
List of Supported Input Parameters?
2
1
#34 opened over 1 year ago
by
souvik0306

Understands Finnish well
1
#33 opened over 1 year ago
by
Ukuli

whisper outputs non stop no no no
3
#32 opened over 1 year ago
by
rizwanishaq
Word timestamps and "return_language" at the same time
4
#31 opened over 1 year ago
by
Oscaarjs

Batched files with different languages
#30 opened over 1 year ago
by
Oscaarjs

Wrong Suffix after Downloading
1
#29 opened over 1 year ago
by
seriphus
Add TF weights
1
#28 opened over 1 year ago
by
vlad-skripniuk
Setting whisper-specific options in pipeline
4
1
#26 opened over 1 year ago
by
MichalQU
no_speech_prob in pipeline?
3
#22 opened over 1 year ago
by
rizwanishaq
language='cantonese' error
6
#21 opened over 1 year ago
by
Boxp
Speculative Decoding Snippet Not Working
4
#20 opened over 1 year ago
by
nateraw

Hallucination / repetition
2
#19 opened over 1 year ago
by
persundstrom
can i use this free ?
7
#18 opened over 1 year ago
by
AamirFarooq

How to get the language in default mode
1
#15 opened over 1 year ago
by
nastasev
Error with word level timestamps
7
#12 opened over 1 year ago
by
souvik0306

How to get Word and Verbose level transcription?
3
#11 opened over 1 year ago
by
souvik0306

Translate using the API
3
2
#10 opened over 1 year ago
by
SharatChandra
Error when running a model inference
3
5
#8 opened over 1 year ago
by
agonben23
TokenizerFast with tokenizer.json
2
#7 opened over 1 year ago
by
vubiqus
Why float16? Why not Float32
3
#4 opened over 1 year ago
by
ctranslate2-4you
Currently working on finetuning dataset for japanese anime speech
2
2
#3 opened over 1 year ago
by
joujiboi
