Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

hal

community
https://github.com/benediktstroebl/agent-eval-harness/tree/main
benediktstroebl
benediktstroebl
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

benediktstroebl  updated a dataset 5 days ago
agent-evals/hal_traces
Peterkirgis  updated a dataset 9 days ago
agent-evals/hal_traces
siegelz  updated a dataset 11 days ago
agent-evals/hal_traces
View all activity

Benedikt Stroebl's profile picture Sayash Kapoor's profile picture Arvind Narayanan's profile picture Zachary Siegel's profile picture Boyi Wei's profile picture Peter Kirgis's profile picture wave's profile picture Ziru Chen's profile picture Yifei Zhou's profile picture

spaces 2

Running

Agent Leaderboard

🏆

Display agent leaderboards for various benchmarks

Dec 5, 2024
Running

Agent Leaderboard

🏆

Nov 18, 2024

models 0

None public yet

datasets 3

agent-evals/hal_traces

Updated 5 days ago • 208

agent-evals/agent_traces

Updated Apr 6 • 735

agent-evals/results

Updated Jan 16 • 15
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs