CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Paper • 2502.16614 • Published Feb 23 • 27
FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue Paper • 2306.10315 • Published Jun 17, 2023 • 1
DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning Paper • 2402.09136 • Published Feb 14, 2024 • 1
PreAct: Predicting Future in ReAct Enhances Agent's Planning Ability Paper • 2402.11534 • Published Feb 18, 2024 • 1
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery Paper • 2406.08587 • Published Jun 12, 2024 • 16
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data Paper • 2409.03810 • Published Sep 5, 2024 • 36