ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning Paper • 2505.15776 • Published May 21 • 10
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning Paper • 2503.10480 • Published Mar 13 • 54
view article Article Evaluating Audio Reasoning with Big Bench Audio By mhillsmith and 1 other • Dec 20, 2024 • 21