-
Let LLMs Break Free from Overthinking via Self-Braking Tuning
Paper • 2505.14604 • Published • 23 -
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
Training Step-Level Reasoning Verifiers with Formal Verification Tools
Paper • 2505.15960 • Published • 7 -
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Paper • 2505.15134 • Published • 6
Felix Tuma
floom
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
about 23 hours ago
Deep Researcher with Test-Time Diffusion
upvoted
a
paper
4 days ago
Group Sequence Policy Optimization
upvoted
a
paper
6 days ago
Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning
Organizations
None yet