-
Let LLMs Break Free from Overthinking via Self-Braking Tuning
Paper • 2505.14604 • Published • 23 -
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
Training Step-Level Reasoning Verifiers with Formal Verification Tools
Paper • 2505.15960 • Published • 7 -
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Paper • 2505.15134 • Published • 6
Felix Tuma
floom
AI & ML interests
NLP
Recent Activity
updated
a collection
about 20 hours ago
PotentialApplication
upvoted
a
paper
about 20 hours ago
Prompt Orchestration Markup Language
upvoted
a
paper
about 20 hours ago
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent
Distillation and Agentic RL
Organizations
None yet