Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
mib-bench
/
leaderboard
like
4
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
leaderboard
/
eval-results-mib-causalgraph
/
submissions
Ctrl+K
Ctrl+K
5 contributors
History:
1 commit
Aaron Mueller
updated filtering, add F= tab
1d8e193
6 months ago
MCQA_results_Qwen_correct_choice_period_token.json
Safe
40.5 kB
updated filtering, add F= tab
6 months ago
MCQA_results_Qwen_last_correct_choice_token.json
Safe
39.1 kB
updated filtering, add F= tab
6 months ago
MCQA_results_Qwen_last_token.json
Safe
39.6 kB
updated filtering, add F= tab
6 months ago
MCQA_results_Qwen_second_to_last_token.json
Safe
39.1 kB
updated filtering, add F= tab
6 months ago
MCQA_results_google_correct_choice_period_token.json
Safe
43.5 kB
updated filtering, add F= tab
6 months ago
MCQA_results_google_correct_choice_token.json
Safe
42.8 kB
updated filtering, add F= tab
6 months ago
MCQA_results_google_last_token.json
Safe
43 kB
updated filtering, add F= tab
6 months ago
MCQA_results_google_second_to_last_token.json
Safe
42.8 kB
updated filtering, add F= tab
6 months ago
MCQA_results_meta-llama_correct_choice_period_token.json
Safe
54.3 kB
updated filtering, add F= tab
6 months ago
MCQA_results_meta-llama_correct_choice_token.json
Safe
53.2 kB
updated filtering, add F= tab
6 months ago
MCQA_results_meta-llama_last_token.json
Safe
53.5 kB
updated filtering, add F= tab
6 months ago
MCQA_results_meta-llama_second_to_last_token.json
Safe
53.2 kB
updated filtering, add F= tab
6 months ago