Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
agent-evals
/
leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
201da5d
leaderboard
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
benediktstroebl
added appworld gaia cybench.
201da5d
6 months ago
agent_monitor
minor tweaks
7 months ago
utils
added appworld gaia cybench.
6 months ago
.gitattributes
Safe
1.58 kB
Upload preprocessed_traces.db
7 months ago
.gitignore
Safe
74 Bytes
init v1
7 months ago
README.md
Safe
236 Bytes
init v1
7 months ago
about.md
Safe
5.39 kB
init v1
7 months ago
agent_performance_analysis.json
Safe
5.08 kB
init v1
7 months ago
agent_submission.md
Safe
766 Bytes
init v1
7 months ago
agent_submission_core.md
Safe
2.77 kB
init v1
7 months ago
app.py
Safe
90.3 kB
added appworld gaia cybench.
6 months ago
benchmark_submission.md
Safe
496 Bytes
init v1
7 months ago
config.py
Safe
3.77 kB
added appworld gaia cybench.
6 months ago
css.css
Safe
936 Bytes
init v1
7 months ago
envs.py
Safe
191 Bytes
init v1
7 months ago
hal.ico
Safe
15.4 kB
init v1
7 months ago
hal.png
Safe
1.03 kB
init v1
7 months ago
header.md
Safe
118 Bytes
init v1
7 months ago
preprocessed_traces.db
Safe
1.95 GB
LFS
Upload preprocessed_traces.db
7 months ago
requirements.txt
Safe
1.84 kB
init v1
7 months ago
verified_agents.yaml
Safe
4.96 kB
added appworld gaia cybench.
6 months ago