Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
agent-evals
/
leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
512799d
leaderboard
Ctrl+K
Ctrl+K
1 contributor
History:
9 commits
benediktstroebl
hide swebench lite and mlagentbench
512799d
9 months ago
agent_monitor
minor tweaks
9 months ago
utils
minor tweaks
9 months ago
.gitattributes
Safe
1.58 kB
Upload preprocessed_traces.db
9 months ago
.gitignore
Safe
74 Bytes
init v1
9 months ago
README.md
Safe
236 Bytes
init v1
9 months ago
about.md
Safe
5.39 kB
init v1
9 months ago
agent_performance_analysis.json
Safe
5.08 kB
init v1
9 months ago
agent_submission.md
Safe
766 Bytes
init v1
9 months ago
agent_submission_core.md
Safe
2.77 kB
init v1
9 months ago
app.py
Safe
82.2 kB
hide swebench lite and mlagentbench
9 months ago
benchmark_submission.md
Safe
496 Bytes
init v1
9 months ago
config.py
Safe
2.07 kB
init v1
9 months ago
css.css
Safe
936 Bytes
init v1
9 months ago
envs.py
Safe
191 Bytes
init v1
9 months ago
hal.ico
Safe
15.4 kB
init v1
9 months ago
hal.png
Safe
1.03 kB
init v1
9 months ago
header.md
Safe
118 Bytes
init v1
9 months ago
preprocessed_traces.db
Safe
1.95 GB
LFS
Upload preprocessed_traces.db
9 months ago
requirements.txt
Safe
1.84 kB
init v1
9 months ago
scratch.ipynb
Safe
0 Bytes
init v1
9 months ago
scratch.py
Safe
1.61 kB
init v1
9 months ago
verified_agents.yaml
Safe
3.94 kB
minor tweaks
9 months ago