VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 5 items • Updated 6 days ago • 106
view article Article 🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders By adaamko and 1 other • 7 days ago • 9
jina-code-embeddings Collection high quality code embeddings trained from code generation models • 5 items • Updated 3 days ago • 11
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model By tomaarsen and 5 others • 3 days ago • 145
On the Theoretical Limitations of Embedding-Based Retrieval Paper • 2508.21038 • Published 10 days ago • 15
view article Article Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications By nmmursit and 5 others • 9 days ago • 26
view article Article SAIR: Accelerating Pharma R&D with AI-Powered Structural Intelligence By SandboxAQ and 3 others • 5 days ago • 29
view article Article Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation By cbensimon and 3 others • 5 days ago • 39
Contextual AI Reranker v2 Collection Family of instruction-following multilingual rerankers on the cost/performance Pareto frontier across public and customer benchmarks • 6 items • Updated 12 days ago • 8
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks By nvidia and 4 others • 27 days ago • 72
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens Paper • 2508.05305 • Published about 1 month ago • 45
view article Article Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • 29 days ago • 12
ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability Paper • 2508.07050 • Published 29 days ago • 114
SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension Paper • 2508.01959 • Published Aug 3 • 57