runtime error
Exit code: 1. Reason: Value(dtype='string', id=None), 'agent_name': Value(dtype='string', id=None), 'model_family': Value(dtype='string', id=None), 'organisation': Value(dtype='string', id=None), 'repo_url': Value(dtype='string', id=None), 'date': Value(dtype='string', id=None), 'validated': Value(dtype='bool', id=None)} because column names don't match During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/user/app/app.py", line 29, in <module> refresh(only_leaderboard=False) File "/home/user/app/dabstep_benchmark/leaderboard.py", line 40, in refresh DATASETS[f"{config_name}"] = load_dataset( File "/usr/local/lib/python3.10/site-packages/datasets/load.py", line 2084, in load_dataset builder_instance.download_and_prepare( File "/usr/local/lib/python3.10/site-packages/datasets/builder.py", line 925, in download_and_prepare self._download_and_prepare( File "/usr/local/lib/python3.10/site-packages/datasets/builder.py", line 1001, in _download_and_prepare self._prepare_split(split_generator, **prepare_split_kwargs) File "/usr/local/lib/python3.10/site-packages/datasets/builder.py", line 1742, in _prepare_split for job_id, done, content in self._prepare_split_single( File "/usr/local/lib/python3.10/site-packages/datasets/builder.py", line 1873, in _prepare_split_single raise DatasetGenerationCastError.from_cast_error( datasets.exceptions.DatasetGenerationCastError: An error occurred while generating the dataset All the data files must have the same columns, but at some point there are 1 new columns ({'system_prompt'}) This happened while the json dataset builder was generating data using hf://datasets/adyen/DABstep/data/submissions/v1__Adyen-Claude 3.5 Generic Agent__23-01-2025.jsonl (at revision 33bfc3024e71da593e37d7d2c2dc4fb20ecddfee) Please either edit the data files to have matching columns, or separate them into different configurations (see docs at https://hf.co/docs/hub/datasets-manual-configuration#multiple-configurations)
Container logs:
Fetching error logs...