Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth Paper β’ 2509.03867 β’ Published 4 days ago β’ 178
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions? Paper β’ 2509.04292 β’ Published 3 days ago β’ 48
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper β’ 2509.02544 β’ Published 5 days ago β’ 105
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions? Paper β’ 2509.04292 β’ Published 3 days ago β’ 48
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning Paper β’ 2509.02479 β’ Published 5 days ago β’ 76
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling Paper β’ 2508.17445 β’ Published 14 days ago β’ 78
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper β’ 2509.02544 β’ Published 5 days ago β’ 105