Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
f9c6a2b
core_leaderboard
/
utils
Ctrl+K
Ctrl+K
3 contributors
History:
10 commits
benediktstroebl
updated width of plot
be40ce5
10 months ago
data.py
Safe
9.47 kB
format update and added monitor llm client backend
10 months ago
pareto.py
Safe
1.34 kB
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard
10 months ago
processing.py
Safe
5.97 kB
update to avoid automatic processing
10 months ago
viz.py
Safe
8.58 kB
updated width of plot
10 months ago