Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
LLM360
/
k2-eval-gallery
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
9868b53
k2-eval-gallery
/
eval-results
Ctrl+K
Ctrl+K
3 contributors
History:
5 commits
mylibrar
Format evaluation results
9868b53
about 1 year ago
arc_challenge
Format evaluation results
about 1 year ago
arc_easy
Format evaluation results
about 1 year ago
bbh_cot_fewshot
Format evaluation results
about 1 year ago
crowspairs
Format evaluation results
about 1 year ago
gsm8k
Format evaluation results
about 1 year ago
hellaswag
Format evaluation results
about 1 year ago
humaneval
Format evaluation results
about 1 year ago
logiqa2
Format evaluation results
about 1 year ago
mathqa
Format evaluation results
about 1 year ago
mbpp
Format evaluation results
about 1 year ago
medmcqa
Format evaluation results
about 1 year ago
medqa
Format evaluation results
about 1 year ago
mmlu
Format evaluation results
about 1 year ago
openbookqa5
Format evaluation results
about 1 year ago
piqa5
Format evaluation results
about 1 year ago
pubmedqa
Format evaluation results
about 1 year ago
race
Format evaluation results
about 1 year ago
toxigen
Format evaluation results
about 1 year ago
toxigen2
Format evaluation results
about 1 year ago
truthfulqa_mc2
Format evaluation results
about 1 year ago
winogrande5
Format evaluation results
about 1 year ago