arxiv:2502.13138

AIDE: AI-Driven Exploration in the Space of Code

Published on Feb 18

· Submitted by

dexhunter on Feb 20

Upvote

Authors:

Zhengyao Jiang ,

Dominik Schmidt ,

Dhruv Srikanth ,

Dixing Xu ,

Abstract

Machine learning, the foundation of modern artificial intelligence, has driven innovations that have fundamentally transformed the world. Yet, behind advancements lies a complex and often tedious process requiring labor and compute intensive iteration and experimentation. Engineers and scientists developing machine learning models spend much of their time on trial-and-error tasks instead of conceptualizing innovative solutions or research hypotheses. To address this challenge, we introduce AI-Driven Exploration (AIDE), a machine learning engineering agent powered by large language models (LLMs). AIDE frames machine learning engineering as a code optimization problem, and formulates trial-and-error as a tree search in the space of potential solutions. By strategically reusing and refining promising solutions, AIDE effectively trades computational resources for enhanced performance, achieving state-of-the-art results on multiple machine learning engineering benchmarks, including our Kaggle evaluations, OpenAI MLE-Bench and METRs RE-Bench.

View arXiv page View PDF Add to collection

Community

dexhunter

Paper author Paper submitter 1 day ago

AIDE has stood the test of time as the leading ML engineering agent, showing strong potential to automate data science modeling, deep learning, and AI R&D.

BonaireBear

1 day ago

I am Ian Kaplan, with a Reddit login BonaireBear. I am not a co-author for this paper.

BonaireBear

1 day ago

Please remove me as an author, since I did not write this paper.

dexhunter

Paper author 1 day ago

I thought the paper is self-claimed and I don't have the access/permission to edit the authors. I've contacted the hugging face support team for the matter. 1

panikov

1 day ago

thanks. Future of AI is an iterative world model loss reduction (with regard to sensory observations like video, etc) by another teaching model, any by expanding logic correlations between items of world model

librarian-bot

1 day ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2502.13138 in a dataset README.md to link it from this page.

Spaces citing this paper 1

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.