view article Article Welcome EmbeddingGemma, Google's new efficient embedding model By tomaarsen and 5 others • 5 days ago • 171
view article Article Introducing ColQwen-Omni: Retrieve in every modality By manu and 4 others • Jul 17 • 69
view article Article Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders By orionweller and 5 others • Jul 16 • 67
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • Mar 10 • 146
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published Feb 25 • 28
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 535
view article Article Finally, a Replacement for BERT: Introducing ModernBERT By bclavie and 14 others • Dec 19, 2024 • 683
ModernGLiNER Collection GLiNER models based on modern encoder architectures • 2 items • Updated Dec 24, 2024 • 7
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 155
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 150
Reducing the Footprint of Multi-Vector Retrieval with Minimal Performance Impact via Token Pooling Paper • 2409.14683 • Published Sep 23, 2024 • 12
Three Bricks to Consolidate Watermarks for Large Language Models Paper • 2308.00113 • Published Jul 26, 2023 • 14