Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:

Duplicated fromย  benediktstroebl/hal

agent-evals
/
core_leaderboard
Running

App Files Files Community
Fetching metadata from the HF Docker repository...
core_leaderboard
Ctrl+K
Ctrl+K
  • 3 contributors
History: 77 commits
benediktstroebl's picture
benediktstroebl
Upload swebench_verified_Agentless_gpt-4o-mini-2024-07-18_50_Instances_1723916965.json
01fb261 verified 10 months ago
  • agent_monitor
    added timestamp to task summary prompt for failure report and fixed failure report gradio issue 10 months ago
  • evals_live
    Upload swebench_verified_Agentless_gpt-4o-mini-2024-07-18_50_Instances_1723916965.json 10 months ago
  • evals_processed
    init files to keep dirs open 10 months ago
  • evals_upload
    init files to keep dirs open 10 months ago
  • utils
    added failure report and two new swebench variants 10 months ago
  • .gitattributes
    1.99 kB
    update 10 months ago
  • .gitignore
    104 Bytes
    Update .gitignore 10 months ago
  • README copy.md
    14.7 kB
    init 10 months ago
  • README.md
    236 Bytes
    initial commit 10 months ago
  • about.md
    36 Bytes
    update 10 months ago
  • app.py
    28.1 kB
    added timestamp to task summary prompt for failure report and fixed failure report gradio issue 10 months ago
  • config.py
    722 Bytes
    layout update 10 months ago
  • css.css
    2.54 kB
    init 10 months ago
  • envs.py
    191 Bytes
    added auto update 10 months ago
  • requirements.txt
    1.85 kB
    update 10 months ago