Romanian Semantic Textual Similarity Cupidon Collection Cupidon for Romanian texts! 💘 These models feel the chemistry between sentence pairs in RO. • 5 items • Updated 2 days ago • 3
NeuralNDCG: Direct Optimisation of a Ranking Metric via Differentiable Relaxation of Sorting Paper • 2102.07831 • Published Feb 15, 2021 • 1
🥗 FoodEx2 System Collection Datasets and Models for the FoodEx2 System Project • 10 items • Updated 6 days ago • 1
AraEuroBERT Collection Ara-EuroBERT is a collection of Arabic Semantic Embeddings built on EuroBERT, delivering adaptive embeddings with ultra-long context. • 5 items • Updated 2 days ago • 2
DistilCamemBERT: a distillation of the French model CamemBERT Paper • 2205.11111 • Published May 23, 2022 • 3
CamemBERT 2.0: A Smarter French Language Model Aged to Perfection Paper • 2411.08868 • Published Nov 13, 2024 • 13
Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published 14 days ago • 33
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 13 days ago • 342
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset 14 days ago • 67
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers Paper • 2502.03793 • Published Feb 6 • 4
EuroBERT Collection Scaling Multilingual Encoders for European Languages • 4 items • Updated 14 days ago • 10
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 17 days ago • 75