DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5.
Simin Chen
CM
AI & ML interests
None yet
Recent Activity
updated
a dataset
3 days ago
CM/humaneval_trans_java_python
published
a dataset
3 days ago
CM/humaneval_trans_java_python
upvoted
a
collection
21 days ago
DyCodeEval