ChatDOC commited on
Commit
7a9d91b
·
verified ·
1 Parent(s): 5441386

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -34
README.md CHANGED
@@ -1,34 +1,33 @@
1
- ---
2
- language:
3
- - en
4
- license: apache-2.0
5
- benchmarks:
6
- - ChatDoc/OCRFlux-bench-single
7
- - ChatDoc/OCRFlux-bench-cross
8
- - ChatDoc/OCRFlux-pubtabnet-single
9
- - ChatDoc/OCRFlux-pubtabnet-cross
10
- base_model:
11
- - Qwen/Qwen2.5-VL-3B-Instruct
12
- library_name: transformers
13
- ---
14
-
15
- <img alt="OCRFlux Logo" src="https://github.com/chatdoc-com/OCRFlux/blob/main/images/OCRFlux.png" width="242px" style="margin-left:'auto' margin-right:'auto' display:'block'">
16
-
17
- # OCRFlux
18
-
19
- This is a preview release of the OCRFlux model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from
20
- [olmOCR-mix-0225](https://huggingface.co/datasets/allenai/olmOCR-mix-0225) dataset.
21
-
22
- Quick links:
23
- - 🛠️ [Code](https://github.com/chatdoc-com/OCRFlux)
24
-
25
- ## Usage
26
-
27
- The best way to use this model is via the [OCRFlux toolkit](https://github.com/chatdoc-com/OCRFlux).
28
- The toolkit comes with an efficient inference setup via vllm that can handle millions of documents
29
- at scale.
30
-
31
- ## License and use
32
-
33
- OCRFlux is licensed under the Apache 2.0 license.
34
- OCRFlux is intended for research and educational use.
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ benchmarks:
6
+ - ChatDoc/OCRFlux-bench-single
7
+ - ChatDoc/OCRFlux-bench-cross
8
+ - ChatDoc/OCRFlux-pubtabnet-single
9
+ - ChatDoc/OCRFlux-pubtabnet-cross
10
+ base_model:
11
+ - Qwen/Qwen2.5-VL-3B-Instruct
12
+ library_name: transformers
13
+ ---
14
+
15
+
16
+ # OCRFlux
17
+
18
+ This is a preview release of the OCRFlux model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from
19
+ [olmOCR-mix-0225](https://huggingface.co/datasets/allenai/olmOCR-mix-0225) dataset.
20
+
21
+ Quick links:
22
+ - 🛠️ [Code](https://github.com/chatdoc-com/OCRFlux)
23
+
24
+ ## Usage
25
+
26
+ The best way to use this model is via the [OCRFlux toolkit](https://github.com/chatdoc-com/OCRFlux).
27
+ The toolkit comes with an efficient inference setup via vllm that can handle millions of documents
28
+ at scale.
29
+
30
+ ## License and use
31
+
32
+ OCRFlux is licensed under the Apache 2.0 license.
33
+ OCRFlux is intended for research and educational use.