Update README.md
Browse files
README.md
CHANGED
@@ -88,7 +88,7 @@ It was trained using HuggingFace Transformers and Pytorch, using the [Causal Mod
|
|
88 |
### Training data
|
89 |
|
90 |
|
91 |
-
The training corpus consists of texts in 4 languages, with an emphasis on Portuguese
|
92 |
|
93 |
The corpus is composed as follows:
|
94 |
|
|
|
88 |
### Training data
|
89 |
|
90 |
|
91 |
+
The training corpus consists of texts in 4 languages, with an emphasis on Portuguese. The main aim of this is to ensure that the model learns to work with this language perfectly, while maintaining knowledge of languages already known (Spanish, English), learning others (Galician) or adapting existing language varieties (Portuguese-PT instead of Portuguese-BR).
|
92 |
|
93 |
The corpus is composed as follows:
|
94 |
|