rohansampath commited on
Commit
164123e
·
verified ·
1 Parent(s): 49b1770

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- title: Evaluations on the MMLU-Pro (2024) Dataset
3
  emoji: 🦀
4
  colorFrom: indigo
5
  colorTo: blue
@@ -8,9 +8,9 @@ sdk_version: 5.16.1
8
  app_file: app.py
9
  pinned: false
10
  license: apache-2.0
11
- short_description: Evaluates various models on the MMLU-Pro Dataset.
12
  ---
13
- This Space replicates the evaluation of various models on the MMLU-Pro Dataset.
14
  Dataset: https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro
15
  GitHub: https://github.com/TIGER-AI-Lab/MMLU-Pro
16
  Paper: https://arxiv.org/abs/2406.01574 (Submitted at NeurIPS 2024)
 
1
  ---
2
+ title: Head to Head Evaluations Comparator
3
  emoji: 🦀
4
  colorFrom: indigo
5
  colorTo: blue
 
8
  app_file: app.py
9
  pinned: false
10
  license: apache-2.0
11
+ short_description: Evaluates 2 models or 1 model w/diff configs on a dataset
12
  ---
13
+ This Space replicates the evaluation of different models on various datasets.
14
  Dataset: https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro
15
  GitHub: https://github.com/TIGER-AI-Lab/MMLU-Pro
16
  Paper: https://arxiv.org/abs/2406.01574 (Submitted at NeurIPS 2024)