File size: 820 Bytes
1ceb840 d9b06a4 1ceb840 d9b06a4 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 |
# Beyond Document Page Classification
We release the benchmarking code together with the proposed datasets:
* https://huggingface.co/datasets/bdpc/rvl_cdip_mp
* https://huggingface.co/datasets/bdpc/rvl_cdip_n_mp
For consistency, we add it as an anonymous model repository (can be cloned) in HuggingFace.
## Installation
The scripts require [python >= 3.8](https://www.python.org/downloads/release/python-380/) to run.
We will create a fresh virtualenvironment in which to install all required packages.
```sh
mkvirtualenv -p /usr/bin/python3 BYD
```
Using poetry and the readily defined pyproject.toml, we will install all required packages
```sh
workon BYD
pip3 install poetry
poetry install
```
## Experiments
To replicate all experiment results from the paper, run experiments.sh
```sh
./experiments.sh
```
|