Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:

Duplicated fromย  benediktstroebl/hal

agent-evals
/
core_leaderboard
Running

App Files Files Community
Fetching metadata from the HF Docker repository...
core_leaderboard
Ctrl+K
Ctrl+K
  • 3 contributors
History: 150 commits
Zachary Siegel
updatae db
eadf8af 6 months ago
  • agent_monitor
    Big update with SQL backend 9 months ago
  • evals_live
    fix typo in agent name 6 months ago
  • evals_processed
    init files to keep dirs open 10 months ago
  • evals_upload
    init files to keep dirs open 10 months ago
  • utils
    add results to leaderboard 8 months ago
  • .gitattributes
    2.05 kB
    Upload preprocessed_traces.db 9 months ago
  • .gitignore
    115 Bytes
    update corebench results 6 months ago
  • README copy.md
    14.7 kB
    init 10 months ago
  • README.md
    236 Bytes
    initial commit 10 months ago
  • about.md
    5.39 kB
    Upload 3 files 9 months ago
  • agent_submission.md
    2.76 kB
    submit to any of the three levels 8 months ago
  • app.py
    18.7 kB
    update title 8 months ago
  • benchmark_submission.md
    496 Bytes
    Upload 3 files 9 months ago
  • config.py
    1.37 kB
    added first agent to leaderboard 8 months ago
  • css.css
    997 Bytes
    vis update 9 months ago
  • envs.py
    191 Bytes
    added auto update 10 months ago
  • hal.ico
    15.4 kB
    Upload 5 files 9 months ago
  • hal.png
    1.03 kB
    Upload 5 files 9 months ago
  • header.md
    118 Bytes
    vis update 9 months ago
  • preprocessed_traces.db
    128 MB
    LFS
    updatae db 6 months ago
  • requirements.txt
    1.86 kB
    Upload requirements.txt 9 months ago
  • scratch.py
    1.61 kB
    vis update 9 months ago
  • verified_agents.yaml
    1.3 kB
    verify o1 mini 6 months ago