Spaces:
Running
Running
name,zero_self_con,zero_cot_self_con,few_self_con,few_cot_self_con | |
Aquilachat2-34B,36.63,44.83,46.65, | |
Baichuan-13B-Chat,20.4,37.0,26.7,17.8 | |
Baichuan2-13B-Chat,15.3,25.8,33.1,27.7 | |
Chatglm2-6B,24.7,36.5,37.6,40.5 | |
Chatglm3-6B,43.38487973,44.58762887,42.09621993,43.47079038 | |
Chinese-Alpaca-2-13B,37.7,49.7,48.6,50.5 | |
Chinese-Llama-2-13B,29.4,37.8,40.4,28.8 | |
Devops-Model-14B-Chat,30.59,63.63,61.96,44.01 | |
Ernie-Bot-4.0,61.15,70.0,60.0,70.0 | |
Gpt-3.5-Turbo,66.8,72.0,68.3,72.5 | |
Gpt-4,,,,88.7 | |
Internlm-7B,38.7,43.9,45.2,51.4 | |
Internlm2-Chat-20B,56.35738832,26.18025751,60.48109966,45.10309278 | |
Internlm2-Chat-7B,49.74226804,56.18556701,48.19587629,49.74226804 | |
Llama-2-13B,46.5,58.7,53.0,61.0 | |
Llama-2-70B-Chat,25.29,58.06,52.97,58.55 | |
Llama-2-7B,40.0,49.5,46.8,55.2 | |
Mistral-7B,29.27,46.3,47.22,45.58 | |
Qwen-14B-Chat,47.81,59.4,59.7,55.88 | |
Qwen-72B-Chat,70.5,72.56,70.32,70.22 | |
Qwen-7B-Chat,46.0,50.1,51.0,49.8 | |
Yi-34B-Chat,59.14,68.79,68.37,80.06 | |
Claude-3-Opus,69.03417341637355,,, | |
gemma_2b,26.46048,33.41924,26.6323,37.54296 | |
gemma_7b,25.08591,50.85911,30.24055,51.55747 | |
Meta-Llama-3-8B-Instruct,38.279481659390655,76.69172932330827,23.734458771084668,33.241749376506874 | |
Qwen1.5-14B-Base,34.87973,60.82474,65.54983,47.07904 | |
Qwen1.5-14B-Chat,56.4433,67.09622,53.52234,64.17526 | |