Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
LLM360
/
k2-eval-gallery
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
c33c8c6
k2-eval-gallery
/
eval-results
Ctrl+K
Ctrl+K
3 contributors
History:
4 commits
mylibrar
Upload other metrics
c33c8c6
about 1 year ago
arc_challenge
Upload other metrics
about 1 year ago
arc_easy
Upload other metrics
about 1 year ago
bbh_cot_fewshot
Upload other metrics
about 1 year ago
crowspairs
Upload 3 metrics
about 1 year ago
gsm8k
Upload other metrics
about 1 year ago
hellaswag
Upload other metrics
about 1 year ago
humaneval
Upload results for 3 metrics
about 1 year ago
logiqa2
Upload other metrics
about 1 year ago
mathqa
Upload other metrics
about 1 year ago
mbpp
Upload results for 3 metrics
about 1 year ago
medmcqa
Upload other metrics
about 1 year ago
medqa
Upload more metrics and fix some issues in app.py
about 1 year ago
mmlu
Upload other metrics
about 1 year ago
openbookqa5
Upload 3 metrics
about 1 year ago
piqa5
Upload more metrics and fix some issues in app.py
about 1 year ago
pubmedqa
Upload more metrics and fix some issues in app.py
about 1 year ago
race
Upload other metrics
about 1 year ago
toxigen
Upload results for 3 metrics
about 1 year ago
toxigen2
Upload more metrics and fix some issues in app.py
about 1 year ago
truthfulqa_mc2
Upload other metrics
about 1 year ago
winogrande5
Upload 3 metrics
about 1 year ago