|
--- |
|
language: |
|
- en |
|
license: apache-2.0 |
|
benchmarks: |
|
- ChatDoc/OCRFlux-bench-single |
|
- ChatDoc/OCRFlux-bench-cross |
|
- ChatDoc/OCRFlux-pubtabnet-single |
|
- ChatDoc/OCRFlux-pubtabnet-cross |
|
base_model: |
|
- Qwen/Qwen2.5-VL-3B-Instruct |
|
library_name: transformers |
|
--- |
|
|
|
|
|
# OCRFlux-3B |
|
|
|
This is a preview release of the OCRFlux-3B model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from |
|
[olmOCR-mix-0225](https://huggingface.co/datasets/allenai/olmOCR-mix-0225) dataset. |
|
|
|
Quick links: |
|
- 🛠️ [Code](https://github.com/chatdoc-com/OCRFlux) |
|
|
|
## Usage |
|
|
|
The best way to use this model is via the [OCRFlux toolkit](https://github.com/chatdoc-com/OCRFlux). |
|
The toolkit comes with an efficient inference setup via vllm that can handle millions of documents |
|
at scale. |
|
|
|
## License and use |
|
|
|
OCRFlux is licensed under the Apache 2.0 license. |
|
OCRFlux is intended for research and educational use. |
|
|