Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance Paper • 2505.16348 • Published May 22 • 53
One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL Paper • 2506.02338 • Published Jun 3 • 4
Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance Paper • 2505.16348 • Published May 22 • 53
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents Paper • 2310.09343 • Published Oct 13, 2023 • 2
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback Paper • 2311.07215 • Published Nov 13, 2023 • 3
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models Paper • 2404.02575 • Published Apr 3, 2024 • 51
Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation Paper • 2402.13211 • Published Feb 20, 2024
Evaluating Robustness of Reward Models for Mathematical Reasoning Paper • 2410.01729 • Published Oct 2, 2024
Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance Paper • 2505.16348 • Published May 22 • 53
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models Paper • 2406.05761 • Published Jun 9, 2024 • 3
Evaluating Robustness of Reward Models for Mathematical Reasoning Paper • 2410.01729 • Published Oct 2, 2024
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics Paper • 2406.14703 • Published Jun 20, 2024 • 2
Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval Paper • 2503.23033 • Published Mar 29 • 2