danielheinz
/

e5-base-sts-en-de

Feature Extraction

text-embeddings-inference

Model card Files Files and versions

danielheinz commited on Jan 14, 2024

Commit

e9bd328

·

verified ·

1 Parent(s): f23cbcf

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -18,6 +18,8 @@ model-index:
         - type: spearmanr
           value: 0.904
 ---
 The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.
 ## Model Training
@@ -26,6 +28,10 @@ The model has been fine-tuned on the German subsets of the following datasets:
 - [paws-x](https://huggingface.co/datasets/paws-x)
 - [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)
 # Results
 The model achieves the following results:
 - 0.920 on stsb's validation subset

         - type: spearmanr
           value: 0.904
 ---
+**INFO**: The model is being continuously updated.
 The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.
 ## Model Training
 - [paws-x](https://huggingface.co/datasets/paws-x)
 - [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)
+The training procedure can be divided into two stages:
+- training on paraphrase corpora with the Multiple Negatives Ranking Loss
+- training on semantic textual similarity using theh Cosine Similarity Loss
 # Results
 The model achieves the following results:
 - 0.920 on stsb's validation subset