Safetensors
t5
robinq commited on
Commit
50cad5e
·
verified ·
1 Parent(s): d61dca7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -3
README.md CHANGED
@@ -1,3 +1,19 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ # Swedish OCR Correction
6
+
7
+ This model is an updated version of https://huggingface.co/viklofg/swedish-ocr-correction
8
+
9
+ The model has been trained to correct OCR predictions by Abbyy, Tesseract, and a combination of those on newspaper from 1818-2018 (see [A Two-OCR Engine Method for Digitized Swedish Newspapers](https://ecp.ep.liu.se/index.php/clarin/article/view/8) ).
10
+
11
+ Please check the [original model](https://huggingface.co/viklofg/swedish-ocr-correction) for more information.
12
+
13
+ This new model has been trained much longer and manages to outperform the previous one using the same train-test split.
14
+
15
+ | Model | CER | WER |
16
+ | - | - | - |
17
+ | Original OCR | 0.030104242197988382 | 0.13228245334217123 |
18
+ | viklofg | 0.019238745627081718 | 0.07410216243545432 |
19
+ | KBLab | 0.015691332569226026 | 0.06230781344539208 |