olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 13 days ago • 91
view article Article Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers By davidberenstein1957 • Feb 5 • 9
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 • Jan 27 • 18
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 129
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval Paper • 2407.19669 • Published Jul 29, 2024 • 23
view article Article Experimenting with Automatic PII Detection on the Hub using Presidio Jul 10, 2024 • 24
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24, 2024 • 189
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ By xhluca • Jul 9, 2024 • 43
YOLOv10 Collection This collection hosts the YOLOv10 model releases • 16 items • Updated Jun 3, 2024 • 18
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval Paper • 2401.18059 • Published Jan 31, 2024 • 39