Sort leaderboard entries by "Final Solution Accuracy" in hf_utils.py 43dd2bb Running kostis-init commited on 5 days ago
Add base LLM and modelling framework to submission metadata; update leaderboard columns 444cb2e kostis-init commited on 6 days ago
Refactor evaluation logic: streamline user_eval.py, update evaluation script references, and clean up eval.py 70cc330 kostis-init commited on 6 days ago
update leaderboard entry parsing, enhance LLM client configuration, and correct submission file link 60a95c1 kostis-init commited on 7 days ago
update leaderboard columns and enhance evaluation summary reporting 2e2392c kostis-init commited on 7 days ago
add extra hf dataset for persistent storage of submissions and results 180f9fe kostis-init commited on 22 days ago