ClaimExtractor-2605 Collection Extract claims and intents from conversations • 6 items • Updated 1 day ago • 7
Verbatim RAG v1 Collection Hallucination free RAG and out SOTA state-of-the-art extractors • 8 items • Updated 4 days ago • 9
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 43 items • Updated 15 days ago • 46
Adaptive Chunking: Optimizing Chunking-Method Selection for RAG Paper • 2603.25333 • Published Mar 26 • 4
(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models Paper • 2604.16429 • Published 25 days ago • 2
DeAR-Reranking Collection DeAR (Deep Agent Rank): Dual-Stage Document Reranking with Reasoning Agents Accepted at EMNLP Findings 2025 • 12 items • Updated Oct 21, 2025 • 2
view article Article OlmoEarth v1.1: A more efficient family of Earth observation models allenai • 18 days ago • 22
view article Article Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality ibm-granite • 23 days ago • 32
Precise Zero-Shot Dense Retrieval without Relevance Labels Paper • 2212.10496 • Published Dec 20, 2022 • 6
BiXSE: Improving Dense Retrieval via Probabilistic Graded Relevance Distillation Paper • 2508.06781 • Published Aug 9, 2025 • 1
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 24 days ago • 58
view article Article SSE Retrieval MRL v2: Regularization of Representation Space and Performance Improvement via Hyperparameter Optimization RikkaBotan • 24 days ago • 2
Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient? Paper • 2605.10848 • Published 27 days ago • 5
A Causal Language Modeling Detour Improves Encoder Continued Pretraining Paper • 2605.12438 • Published 26 days ago • 7