Spaces:

logikon
/

open_cot_leaderboard

Running on CPU Upgrade

Gregor Betz commited on Jan 30, 2024

Commit

97d5b99

unverified ·

1 Parent(s): ad554f1

description

Files changed (1) hide show

src/display/about.py CHANGED Viewed

@@ -53,23 +53,12 @@ Performance leaderboards like the [🤗 Open LLM Leaderboard](https://huggingfac
 Unlike these leaderboards, the `/\/` Open CoT Leaderboard assess a model's ability to effectively reason about a `task`:
-| Leaderboard | Measures | Metric | Focus |
-|:---|:---|:---|:---|
-| 🤗 Open LLM Leaderboard | Task performance | Absolute accuracy | Task performance |
-### 🤗 Open LLM Leaderboard
-* Can `model` solve `task`?
-* Measures `task` performance.
-* Metric: absolute accuracy.
-* Covers broad spectrum of `tasks`.
-### `/\/` Open CoT Leaderboard
-* Can `model` do CoT to improve in `task`?
-* Measures ability to reason (about `task`).
-* Metric: relative accuracy gain.
-* Focuses on critical thinking `tasks`.
 ## Test dataset selection (`tasks`)

 Unlike these leaderboards, the `/\/` Open CoT Leaderboard assess a model's ability to effectively reason about a `task`:
+| 🤗 Open LLM Leaderboard | `/\/` Open CoT Leaderboard |
+|:---|:---|
+| Can `model` solve `task`? | Can `model` do CoT to improve in `task`? |
+| Measures `task` performance. | Measures ability to reason (about `task`). |
+| Metric: absolute accuracy. | Metric: relative accuracy gain. |
+| Covers broad spectrum of `tasks`. | Focuses on critical thinking `tasks`. |
 ## Test dataset selection (`tasks`)