Running 8 8 FAT5 (Flash Attention T5) report ⚡ English version of the blog post introducing FAT5 model
hatemestinbejaia/mMiniLML-bi-encoder-KD-v1-0Student_TripletLossAdptativeMargin-1Teacher_marginloss-m15 Sentence Similarity • Updated 6 days ago • 9 • 1
albertus-sussex/veriscrape-book-test-sbert-bs128_lr5e-05_ep3_euclidean_snTrue_spFalse_hn1 Sentence Similarity • Updated 7 days ago • 9 • 1