NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation Paper • 2504.13055 • Published 10 days ago • 18
Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute Paper • 2503.23803 • Published 27 days ago • 8
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published 29 days ago • 46
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published 29 days ago • 46
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published 29 days ago • 46 • 3
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25 • 30
GuardReasoner Collection As LLMs increasingly impact safety-critical applications, ensuring their safety using guardrails remains a key challenge. This paper proposes GuardRea • 5 items • Updated Feb 8 • 1
GuardReasoner Collection As LLMs increasingly impact safety-critical applications, ensuring their safety using guardrails remains a key challenge. This paper proposes GuardRea • 5 items • Updated Feb 8 • 1