metadata
language:
- en
license: apache-2.0
benchmarks:
- ChatDoc/OCRFlux-bench-single
- ChatDoc/OCRFlux-bench-cross
- ChatDoc/OCRFlux-pubtabnet-single
- ChatDoc/OCRFlux-pubtabnet-cross
base_model:
- Qwen/Qwen2.5-VL-3B-Instruct
library_name: transformers
OCRFlux-3B
This is a preview release of the OCRFlux-3B model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from olmOCR-mix-0225 dataset.
Quick links:
- 🛠️ Code
Usage
The best way to use this model is via the OCRFlux toolkit. The toolkit comes with an efficient inference setup via vllm that can handle millions of documents at scale.
License and use
OCRFlux is licensed under the Apache 2.0 license. OCRFlux is intended for research and educational use.