view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 1 day ago • 26
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published 4 days ago • 5
view article Article BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders 2 days ago • 22
BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs Paper • 2604.02045 • Published 8 days ago • 26
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 6
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 3 days ago • 17
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 53
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 Feb 4 • 88
view article Article Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model Feb 4 • 28
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 119
NanoBEIR datasets Collection These datasets are compatible with the (Sparse)NanoBEIREvaluator with Sentence Transformers v5.2+. Also CrossEncoderNanoBEIREvaluator if bm25 column • 16 items • Updated Mar 2 • 15
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5, 2025 • 63