MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction Paper β’ 2305.18969 β’ Published May 30, 2023
Parameter-Efficient Conversational Recommender System as a Language Processing Task Paper β’ 2401.14194 β’ Published Jan 25, 2024
MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models Paper β’ 2406.13975 β’ Published Jun 20, 2024
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models Paper β’ 2407.02883 β’ Published Jul 3, 2024
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper β’ 2502.20238 β’ Published Feb 27 β’ 24
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations Paper β’ 2504.13816 β’ Published Apr 18 β’ 17
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper β’ 2506.07044 β’ Published 3 days ago β’ 91
Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering Paper β’ 2505.23604 β’ Published 13 days ago β’ 24
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper β’ 2506.07044 β’ Published 3 days ago β’ 91
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations Paper β’ 2504.13816 β’ Published Apr 18 β’ 17
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper β’ 2502.20238 β’ Published Feb 27 β’ 24