CP-Bench-Leaderboard / src /hf_utils.py

Commit History

Sort leaderboard entries by "Final Solution Accuracy" in hf_utils.py
43dd2bb
Running

kostis-init commited on

Fix indexing for leaderboard columns in hf_utils.py
a6f5bd8

kostis-init commited on

Add base LLM and modelling framework to submission metadata; update leaderboard columns
444cb2e

kostis-init commited on

Refactor evaluation logic: streamline user_eval.py, update evaluation script references, and clean up eval.py
70cc330

kostis-init commited on

update leaderboard entry parsing, enhance LLM client configuration, and correct submission file link
60a95c1

kostis-init commited on

update leaderboard columns and enhance evaluation summary reporting
2e2392c

kostis-init commited on

change submission from directory to a single jsonl file
21ed616

kostis-init commited on

add extra hf dataset for persistent storage of submissions and results
180f9fe

kostis-init commited on