ai4privacy/llama-ai4privacy-multilingual-categorical-anonymiser-openpii Token Classification • 0.1B • Updated Mar 24 • 1.3k • 15
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 81
Running 8 8 Pre-training Dutch T5 and UL2 Models, evaluation and model lists 🚀 Explore and compare Dutch T5 models for summarization and translation
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers Paper • 2502.03793 • Published Feb 6 • 4
Babel Collection Open Multilingual Large Language Models Serving Over 90% of Global Speakers • 5 items • Updated Apr 15 • 18
ibm-granite/granite-embedding-278m-multilingual Sentence Similarity • 0.3B • Updated Mar 4 • 72.6k • • 45
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published Feb 25 • 27
denniscraandijk/dutch-english-snowflake-arctic-embed-l-v2.0 Sentence Similarity • 0.4B • Updated Feb 14 • 5
denniscraandijk/dutch-english-snowflake-arctic-embed-l-v2.0 Sentence Similarity • 0.4B • Updated Feb 14 • 5