Article to deepen my understanding and reference it in my work
Hello, does anyone know in which article this model was introduced ? I would like to reference it and read it so I can undestand it more precisely.
Hello!
I think this model was actually not included in a paper, although I may be wrong. There are some details for the related models here: https://www.sbert.net/docs/pretrained-models/msmarco-v5.html (this docs page is not usually shown). I'm pretty sure this model was trained using BinaryCrossEntropyLoss.
On CrossEncoder models overall, you can learn more here: https://www.sbert.net/docs/cross_encoder/usage/usage.html
- Tom Aarsen
Okey, thanks a lot for your help !
Hello!
I think this model was actually not included in a paper, although I may be wrong. There are some details for the related models here: https://www.sbert.net/docs/pretrained-models/msmarco-v5.html (this docs page is not usually shown). I'm pretty sure this model was trained using BinaryCrossEntropyLoss.
On CrossEncoder models overall, you can learn more here: https://www.sbert.net/docs/cross_encoder/usage/usage.html
- Tom Aarsen