LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs
Abstract
We introduce LeetCodeDataset, a high-quality benchmark for evaluating and training code-generation models, addressing two key challenges in LLM research: the lack of reasoning-focused coding benchmarks and self-contained training testbeds. By curating LeetCode Python problems with rich metadata, broad coverage, 100+ test cases per problem, and temporal splits (pre/post July 2024), our dataset enables contamination-free evaluation and efficient supervised fine-tuning (SFT). Experiments show reasoning models significantly outperform non-reasoning counterparts, while SFT with only 2.6K model-generated solutions achieves performance comparable to 110K-sample counterparts. The dataset and evaluation framework are available on Hugging Face and Github.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding (2025)
- ProBench: Benchmarking Large Language Models in Competitive Programming (2025)
- OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs (2025)
- OpenCodeReasoning: Advancing Data Distillation for Competitive Coding (2025)
- IterPref: Focal Preference Learning for Code Generation via Iterative Debugging (2025)
- CodeArena: A Collective Evaluation Platform for LLM Code Generation (2025)
- RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper