Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
agent-evals
/
leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
201da5d
leaderboard
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
benediktstroebl
added appworld gaia cybench.
201da5d
9 months ago
agent_monitor
minor tweaks
10 months ago
utils
added appworld gaia cybench.
9 months ago
.gitattributes
Safe
1.58 kB
Upload preprocessed_traces.db
10 months ago
.gitignore
Safe
74 Bytes
init v1
10 months ago
README.md
Safe
236 Bytes
init v1
10 months ago
about.md
Safe
5.39 kB
init v1
10 months ago
agent_performance_analysis.json
Safe
5.08 kB
init v1
10 months ago
agent_submission.md
Safe
766 Bytes
init v1
10 months ago
agent_submission_core.md
Safe
2.77 kB
init v1
10 months ago
app.py
Safe
90.3 kB
added appworld gaia cybench.
9 months ago
benchmark_submission.md
Safe
496 Bytes
init v1
10 months ago
config.py
Safe
3.77 kB
added appworld gaia cybench.
9 months ago
css.css
Safe
936 Bytes
init v1
10 months ago
envs.py
Safe
191 Bytes
init v1
10 months ago
hal.ico
Safe
15.4 kB
init v1
10 months ago
hal.png
Safe
1.03 kB
init v1
10 months ago
header.md
Safe
118 Bytes
init v1
10 months ago
preprocessed_traces.db
Safe
1.95 GB
LFS
Upload preprocessed_traces.db
10 months ago
requirements.txt
Safe
1.84 kB
init v1
10 months ago
verified_agents.yaml
Safe
4.96 kB
added appworld gaia cybench.
9 months ago