Add dataset versioning support and update leaderboard configuration bea3aa3 kostis-init commited on Aug 8
Update README.md with project structure and development instructions; fix error percentage calculation in user_eval.py 2990bc2 kostis-init commited on Jun 6
Refactor evaluation logic: streamline user_eval.py, update evaluation script references, and clean up eval.py 70cc330 kostis-init commited on Jun 6