Add dataset versioning support and update leaderboard configuration bea3aa3 kostis-init commited on Aug 8
Refactor evaluation logic: streamline user_eval.py, update evaluation script references, and clean up eval.py 70cc330 kostis-init commited on Jun 6
update leaderboard columns and enhance evaluation summary reporting 2e2392c kostis-init commited on Jun 5
remove unused constants and redundant imports; update Dockerfile dependencies 5e53d23 kostis-init commited on May 27
add extra hf dataset for persistent storage of submissions and results 180f9fe kostis-init commited on May 21