File size: 926 Bytes
7a9d91b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
333e5e6
7a9d91b
333e5e6
7a9d91b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
language:
- en
license: apache-2.0
benchmarks:
- ChatDoc/OCRFlux-bench-single
- ChatDoc/OCRFlux-bench-cross
- ChatDoc/OCRFlux-pubtabnet-single
- ChatDoc/OCRFlux-pubtabnet-cross
base_model:
- Qwen/Qwen2.5-VL-3B-Instruct
library_name: transformers
---


# OCRFlux-3B

This is a preview release of the OCRFlux-3B model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from
[olmOCR-mix-0225](https://huggingface.co/datasets/allenai/olmOCR-mix-0225) dataset.

Quick links:
- 🛠️ [Code](https://github.com/chatdoc-com/OCRFlux)

## Usage

The best way to use this model is via the [OCRFlux toolkit](https://github.com/chatdoc-com/OCRFlux).
The toolkit comes with an efficient inference setup via vllm that can handle millions of documents
at scale.

## License and use

OCRFlux is licensed under the Apache 2.0 license.
OCRFlux is intended for research and educational use.