LLM-Independent Adaptive RAG: Let the Question Speak for Itself Paper • 2505.04253 • Published 3 days ago • 8
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Paper • 2505.04601 • Published 2 days ago • 14
OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution Paper • 2505.04606 • Published 2 days ago • 6
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published 5 days ago • 61
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published 2 days ago • 45
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Paper • 2505.02707 • Published 4 days ago • 77
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 4 days ago • 91
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities Paper • 2504.20734 • Published 11 days ago • 61
Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family Paper • 2504.18225 • Published 15 days ago • 12
Running on Zero 686 686 MMAudio — generating synchronized audio from video/text 🔊 Generate audio from video or text prompts