LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model Paper • 2509.00676 • Published 9 days ago • 76
Jointly Reinforcing Diversity and Quality in Language Model Generations Paper • 2509.02534 • Published 6 days ago • 24
Hallucination Score: Towards Mitigating Hallucinations in Generative Image Super-Resolution Paper • 2507.14367 • Published Jul 18
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering Paper • 2305.17080 • Published May 26, 2023
Text Quality-Based Pruning for Efficient Training of Language Models Paper • 2405.01582 • Published Apr 26, 2024
Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation Paper • 2507.08441 • Published Jul 11 • 61
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements Paper • 2506.22419 • Published Jun 27 • 14
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning Paper • 2506.24119 • Published Jun 30 • 49
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning Paper • 2506.09985 • Published Jun 11 • 30