JhonVanced commited on
Commit
d280cd4
1 Parent(s): c2345bb

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +141 -0
README.md ADDED
@@ -0,0 +1,141 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - zh
5
+ - de
6
+ - es
7
+ - ru
8
+ - ko
9
+ - fr
10
+ - ja
11
+ - pt
12
+ - tr
13
+ - pl
14
+ - ca
15
+ - nl
16
+ - ar
17
+ - sv
18
+ - it
19
+ - id
20
+ - hi
21
+ - fi
22
+ - vi
23
+ - he
24
+ - uk
25
+ - el
26
+ - ms
27
+ - cs
28
+ - ro
29
+ - da
30
+ - hu
31
+ - ta
32
+ - 'no'
33
+ - th
34
+ - ur
35
+ - hr
36
+ - bg
37
+ - lt
38
+ - la
39
+ - mi
40
+ - ml
41
+ - cy
42
+ - sk
43
+ - te
44
+ - fa
45
+ - lv
46
+ - bn
47
+ - sr
48
+ - az
49
+ - sl
50
+ - kn
51
+ - et
52
+ - mk
53
+ - br
54
+ - eu
55
+ - is
56
+ - hy
57
+ - ne
58
+ - mn
59
+ - bs
60
+ - kk
61
+ - sq
62
+ - sw
63
+ - gl
64
+ - mr
65
+ - pa
66
+ - si
67
+ - km
68
+ - sn
69
+ - yo
70
+ - so
71
+ - af
72
+ - oc
73
+ - ka
74
+ - be
75
+ - tg
76
+ - sd
77
+ - gu
78
+ - am
79
+ - yi
80
+ - lo
81
+ - uz
82
+ - fo
83
+ - ht
84
+ - ps
85
+ - tk
86
+ - nn
87
+ - mt
88
+ - sa
89
+ - lb
90
+ - my
91
+ - bo
92
+ - tl
93
+ - mg
94
+ - as
95
+ - tt
96
+ - haw
97
+ - ln
98
+ - ha
99
+ - ba
100
+ - jw
101
+ - su
102
+ - yue
103
+ tags:
104
+ - audio
105
+ - automatic-speech-recognition
106
+ license: mit
107
+ library_name: ctranslate2
108
+ ---
109
+ Convert from: sin2piusc/whisper-large-v2-10k
110
+ # Whisper large-v2 model for CTranslate2
111
+
112
+ This repository contains the conversion of [sin2piusc/whisper-large-v2-10k](https://huggingface.co/sin2piusc/whisper-large-v2-10k) to the [CTranslate2](https://github.com/OpenNMT/CTranslate2) model format.
113
+
114
+ This model can be used in CTranslate2 or projects based on CTranslate2 such as [faster-whisper](https://github.com/systran/faster-whisper).
115
+
116
+ ## Example
117
+
118
+ ```python
119
+ from faster_whisper import WhisperModel
120
+
121
+ model = WhisperModel("large-v2")
122
+
123
+ segments, info = model.transcribe("audio.mp3")
124
+ for segment in segments:
125
+ print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
126
+ ```
127
+
128
+ ## Conversion details
129
+
130
+ The original model was converted with the following command:
131
+
132
+ ```
133
+ ct2-transformers-converter --model sin2piusc/whisper-large-v2-10k --output_dir faster-whisper-large-v3-ja \
134
+ --copy_files tokenizer.json preprocessor_config.json --quantization float16
135
+ ```
136
+
137
+ Note that the model weights are saved in FP16. This type can be changed when the model is loaded using the [`compute_type` option in CTranslate2](https://opennmt.net/CTranslate2/quantization.html).
138
+
139
+ ## More information
140
+
141
+ **For more information about the original model, see its [model card](https://huggingface.co/sin2piusc/whisper-large-v2-10k).**