Jae-Won Chung
New leaderboard prototype
b10121d
raw
history blame contribute delete
321 Bytes
{
"Model": "bigcode/starcoder2-7b",
"GPU": "NVIDIA H100 80GB HBM3",
"TP": 1,
"PP": 1,
"Energy/req (J)": 14.092466361237385,
"Avg TPOT (s)": 0.1081798475988885,
"Token tput (tok/s)": 1930.0394156746474,
"Avg Output Tokens": 90.10365853658537,
"Avg BS (reqs)": 492.84644194756555,
"Max BS (reqs)": 512
}