NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation Paper • 2504.13055 • Published 10 days ago • 18
Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute Paper • 2503.23803 • Published 27 days ago • 8
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published 29 days ago • 46
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25 • 30
GuardReasoner Collection As LLMs increasingly impact safety-critical applications, ensuring their safety using guardrails remains a key challenge. This paper proposes GuardRea • 5 items • Updated Feb 8 • 1
Less is More: Fewer Interpretable Region via Submodular Subset Selection Paper • 2402.09164 • Published Feb 14, 2024 • 2
Interpreting Object-level Foundation Models via Visual Precision Search Paper • 2411.16198 • Published Nov 25, 2024 • 2
At Which Training Stage Does Code Data Help LLMs Reasoning? Paper • 2309.16298 • Published Sep 28, 2023 • 1
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Paper • 2410.13754 • Published Oct 17, 2024 • 76