view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others ā¢ 2 days ago ā¢ 112
view article Article Refreshing zero-shot classification with ModernBERT By Ihor ā¢ 15 days ago ā¢ 1
view article Article Agentic RAG Stack (3/5) - Generate responses using a SmolLM By davidberenstein1957 ā¢ Feb 6 ā¢ 6
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 ā¢ Jan 27 ā¢ 18
view article Article Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers By davidberenstein1957 ā¢ Feb 5 ā¢ 9
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other ā¢ Jan 23 ā¢ 64
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 ā¢ 158
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb ā¢ Nov 28, 2024 ā¢ 138
GLiREL -- Generalist Model for Zero-Shot Relation Extraction Paper ā¢ 2501.03172 ā¢ Published Jan 6 ā¢ 1
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Paper ā¢ 2408.04303 ā¢ Published Aug 8, 2024 ā¢ 20
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. ā¢ 9 items ā¢ Updated 16 days ago ā¢ 59
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*ā” By xhluca ā¢ Jul 9, 2024 ā¢ 43
Positions Datasets Collection Datasets where each row is a chess position ā¢ 4 items ā¢ Updated Jan 9 ā¢ 7
Common Models Collection The first generation of models pretrained on Common Corpus. ā¢ 5 items ā¢ Updated Dec 5, 2024 ā¢ 30
Tucano Collection Tucano is a series of decoder-transformers based on the Llama 2 architecture, natively pre-trained in Portuguese. ā¢ 17 items ā¢ Updated Nov 13, 2024 ā¢ 2
Common Corpus Collection Largest multilingual pretraining data. ā¢ 1 item ā¢ Updated Nov 13, 2024 ā¢ 9