Commit History
Fix bug
8ad1a09
xeon27
commited on
Add separate tab for agentic benchmark
1d1f5e9
xeon27
commited on
Use dash symbol for markdown
0796d85
xeon27
commited on
Use dash symbol for markdown
a319d81
xeon27
commited on
Fix bug
e7a2635
xeon27
commited on
Log df shape
f066ed8
xeon27
commited on
Fix bug
b1f9063
xeon27
commited on
Log df shape
116683a
xeon27
commited on
Add '-' for empty results
8555000
xeon27
commited on
Fix bug
323e17d
xeon27
commited on
Fix bug
e7fe9f8
xeon27
commited on
Remove column for average
a2f2df3
xeon27
commited on
Replace missing values by None
18638a9
xeon27
commited on
Change extension of web log file to .eval
cd53742
xeon27
commited on
Remove links to col names due to issues
cdca101
xeon27
commited on
Make task names clickable and link to inspect-evals repo
36244aa
xeon27
commited on
Clean up
2a314d2
xeon27
commited on
Fix bug
a2189ab
xeon27
commited on
Fix bug
ca19cea
xeon27
commited on
Fix bug
346f5e5
xeon27
commited on
Make values clickable
bbde2b0
xeon27
commited on
Debug
2c5e9d1
xeon27
commited on
Debug
c054278
xeon27
commited on
Debug
7c6bd6c
xeon27
commited on
Debug
3a37ec7
xeon27
commited on
Debug
d7d56ae
xeon27
commited on
Remove debug code
40ac9c7
xeon27
commited on
Debug
dea22be
xeon27
commited on