Remove processed item from eval queue b79aa91 Running verified karimouda commited on about 7 hours ago
Add responses file for Qwen3-4B-Thinking-2507 d889832 verified karimouda commited on about 7 hours ago
update requests file for Qwen3-4B-Thinking-2507 e0f8060 verified karimouda commited on about 7 hours ago
Add Qwen/Qwen3-4B-Thinking-2507 request file dea4e3d verified karimouda commited on about 15 hours ago
Add Qwen/Qwen3-4B-Thinking-2507 to eval queue e7b5fa8 verified karimouda commited on about 15 hours ago
Add responses file for Qwen3-4B-Instruct-2507 798749a verified karimouda commited on about 15 hours ago
Add results file for Qwen3-4B-Instruct-2507 13dae5f verified karimouda commited on about 15 hours ago
update requests file for Qwen3-4B-Instruct-2507 a3945d4 verified karimouda commited on about 15 hours ago
Add Qwen/Qwen3-4B-Instruct-2507 request file 3731843 verified karimouda commited on about 16 hours ago
Add Qwen/Qwen3-4B-Instruct-2507 to eval queue e52698e verified karimouda commited on about 16 hours ago
Delete requests/Qwen/Qwen3-4B-Thinking-2507_eval_request.json 4c2875d verified karimouda commited on about 16 hours ago
Delete results/Qwen/Qwen3-4B-Thinking-2507_abb_benchmark_answers_2025-08-11_02-12-21.html 1377b83 verified karimouda commited on about 16 hours ago
Delete results/Qwen/Qwen3-4B-Thinking-2507_results_2025-08-11_02-12-18.json 100def8 verified karimouda commited on about 16 hours ago
Add responses file for Qwen3-4B-Thinking-2507 057cc73 verified karimouda commited on about 20 hours ago
Add results file for Qwen3-4B-Thinking-2507 f43cc7a verified karimouda commited on about 20 hours ago
update requests file for Qwen3-4B-Thinking-2507 d16f549 verified karimouda commited on about 20 hours ago
Add Qwen/Qwen3-4B-Thinking-2507 request file 5938da5 verified karimouda commited on about 24 hours ago
Add Qwen/Qwen3-4B-Thinking-2507 to eval queue 709fd31 verified karimouda commited on about 24 hours ago
Add ibm-granite/granite-3.3-8b-instruct request file e3a56da verified karimouda commited on 2 days ago
Add ibm-granite/granite-3.3-8b-instruct to eval queue dd773ea verified karimouda commited on 2 days ago
Update results/openai/gpt-5-2025-08-07_results_2025-08-07_22-52-43.json 7916e24 verified karimouda commited on 4 days ago