Explore AI agent benchmark and submit results
Configurable Generalist Agent, leader in AppWorld Benchmark