Added computation and display of the standard deviation across individual prompt accuracy values for each task 67324c2 Running rzanoli commited on 22 days ago
Duplicate from demo-leaderboard-backend/leaderboard 6b0f21c verified evalitahf clefourrier HF Staff commited on Mar 13