Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

hal

community
https://github.com/benediktstroebl/agent-eval-harness/tree/main
benediktstroebl
benediktstroebl
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

benediktstroebl  updated a dataset 3 days ago
agent-evals/hal_traces
xuetianci99  updated a dataset 14 days ago
agent-evals/hal_traces
Peterkirgis  updated a dataset 18 days ago
agent-evals/hal_traces
View all activity

Sayash Kapoor's profile picture xuetianci's profile picture Ziru Chen's profile picture Zachary Siegel's profile picture Yifei Zhou's profile picture Boyi Wei's profile picture Benedikt Stroebl's profile picture wave's profile picture Arvind Narayanan's profile picture Peter Kirgis's profile picture

spaces 2

Running

Agent Leaderboard

🏆

Display agent leaderboards for various benchmarks

Dec 5, 2024
Running

Agent Leaderboard

🏆

Nov 18, 2024

models 0

None public yet

datasets 3

agent-evals/hal_traces

Updated 3 days ago • 450

agent-evals/agent_traces

Updated Apr 6 • 3.24k

agent-evals/results

Updated Jan 16 • 37
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs