Update README.md
Browse files
README.md
CHANGED
|
@@ -18,6 +18,8 @@ model-index:
|
|
| 18 |
- type: spearmanr
|
| 19 |
value: 0.904
|
| 20 |
---
|
|
|
|
|
|
|
| 21 |
The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.
|
| 22 |
|
| 23 |
## Model Training
|
|
@@ -26,6 +28,10 @@ The model has been fine-tuned on the German subsets of the following datasets:
|
|
| 26 |
- [paws-x](https://huggingface.co/datasets/paws-x)
|
| 27 |
- [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)
|
| 28 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 |
# Results
|
| 30 |
The model achieves the following results:
|
| 31 |
- 0.920 on stsb's validation subset
|
|
|
|
| 18 |
- type: spearmanr
|
| 19 |
value: 0.904
|
| 20 |
---
|
| 21 |
+
**INFO**: The model is being continuously updated.
|
| 22 |
+
|
| 23 |
The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.
|
| 24 |
|
| 25 |
## Model Training
|
|
|
|
| 28 |
- [paws-x](https://huggingface.co/datasets/paws-x)
|
| 29 |
- [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)
|
| 30 |
|
| 31 |
+
The training procedure can be divided into two stages:
|
| 32 |
+
- training on paraphrase corpora with the Multiple Negatives Ranking Loss
|
| 33 |
+
- training on semantic textual similarity using theh Cosine Similarity Loss
|
| 34 |
+
|
| 35 |
# Results
|
| 36 |
The model achieves the following results:
|
| 37 |
- 0.920 on stsb's validation subset
|