Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
LLM360
/
k2-eval-gallery
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
92274b2
k2-eval-gallery
/
eval-results
Ctrl+K
Ctrl+K
3 contributors
History:
5 commits
mylibrar
Format evaluation results
9868b53
12 months ago
arc_challenge
Format evaluation results
12 months ago
arc_easy
Format evaluation results
12 months ago
bbh_cot_fewshot
Format evaluation results
12 months ago
crowspairs
Format evaluation results
12 months ago
gsm8k
Format evaluation results
12 months ago
hellaswag
Format evaluation results
12 months ago
humaneval
Format evaluation results
12 months ago
logiqa2
Format evaluation results
12 months ago
mathqa
Format evaluation results
12 months ago
mbpp
Format evaluation results
12 months ago
medmcqa
Format evaluation results
12 months ago
medqa
Format evaluation results
12 months ago
mmlu
Format evaluation results
12 months ago
openbookqa5
Format evaluation results
12 months ago
piqa5
Format evaluation results
12 months ago
pubmedqa
Format evaluation results
12 months ago
race
Format evaluation results
12 months ago
toxigen
Format evaluation results
12 months ago
toxigen2
Format evaluation results
12 months ago
truthfulqa_mc2
Format evaluation results
12 months ago
winogrande5
Format evaluation results
12 months ago