Spaces:
Running
Running
name,zero_self_con,zero_cot_self_con,few_self_con,few_cot_self_con | |
Aquilachat2-34B,36.63,44.83,46.65, | |
Baichuan-13B-Chat,11.67,19.52,22.54,28.77 | |
Baichuan2-13B-Chat,19.1,22.9,26.5,24.5 | |
Chatglm2-6B,20.52,19.72,20.12,22.74 | |
Chatglm3-6B,20.92555332,25.15090543,24.74849095,29.1750503 | |
Chinese-Alpaca-2-13B,23.14,28.97,16.3,14.29 | |
Chinese-Llama-2-13B,13.88,20.52,16.9,23.34 | |
Devops-Model-14B-Chat,26.96,38.83,34.81,27.36 | |
Ernie-Bot-4.0,43.8,47.14,46.0,54.0 | |
Gpt-3.5-Turbo,38.83,42.05,37.63,43.86 | |
Gpt-4,,64.56,,62.58 | |
Internlm-7B,26.36,25.55,25.55,27.97 | |
Internlm2-Chat-20B,,59.21052632,, | |
Internlm2-Chat-7B,27.16297787,28.16901408,29.97987928,30.18108652 | |
Llama-2-13B,20.32,29.58,22.33,33.8 | |
Llama-2-70B-Chat,19.72,27.97,26.56,32.6 | |
Llama-2-7B,23.74,26.56,20.52,33.6 | |
Mistral-7B,17.1,26.76,31.19,27.97 | |
Qwen-14B-Chat,28.37,36.62,28.37,24.14 | |
Qwen-72B-Chat,47.48,48.09,49.7,43.66 | |
Qwen-7B-Chat,19.11,23.94,25.55,33.4 | |
Yi-34B-Chat,48.69,46.28,58.35,58.95 | |
Claude-3-Opus,48.31816996021653,,, | |
gemma_2b,16.90141,19.5171,16.09658,24.74849 | |
gemma_7b,14.28571,30.98592,2.60223,43.85965 | |
Meta-Llama-3-8B-Instruct,28.468825409248026,40.47805387073632,23.33528989760647,34.6197743429205 | |
Qwen1.5-14B-Base,29.17505,33.60161,36.82093,27.7666 | |
Qwen1.5-14B-Chat,35.41247,43.05835,33.60161,38.833 | |