Visualize agent interactions with WebArena tasks
Display and analyze evaluation results for agents
SafeArena Leaderboard
Plan travel itinerary with budget tracking
Display and submit evaluation results for travel planning