ChatDOC
/

OCRFlux-3B

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions Community

ChatDOC commited on 10 days ago

Commit

7a9d91b

·

verified ·

1 Parent(s): 5441386

Update README.md

Files changed (1) hide show

README.md +33 -34

README.md CHANGED Viewed

@@ -1,34 +1,33 @@
----
-language:
-- en
-license: apache-2.0
-benchmarks:
-- ChatDoc/OCRFlux-bench-single
-- ChatDoc/OCRFlux-bench-cross
-- ChatDoc/OCRFlux-pubtabnet-single
-- ChatDoc/OCRFlux-pubtabnet-cross
-base_model:
-- Qwen/Qwen2.5-VL-3B-Instruct
-library_name: transformers
----
-<img alt="OCRFlux Logo" src="https://github.com/chatdoc-com/OCRFlux/blob/main/images/OCRFlux.png" width="242px" style="margin-left:'auto' margin-right:'auto' display:'block'">
-# OCRFlux
-This is a preview release of the OCRFlux model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from
-[olmOCR-mix-0225](https://huggingface.co/datasets/allenai/olmOCR-mix-0225) dataset.
-Quick links:
-- 🛠️ [Code](https://github.com/chatdoc-com/OCRFlux)
-## Usage
-The best way to use this model is via the [OCRFlux toolkit](https://github.com/chatdoc-com/OCRFlux).
-The toolkit comes with an efficient inference setup via vllm that can handle millions of documents
-at scale.
-## License and use
-OCRFlux is licensed under the Apache 2.0 license.
-OCRFlux is intended for research and educational use.

+---
+language:
+- en
+license: apache-2.0
+benchmarks:
+- ChatDoc/OCRFlux-bench-single
+- ChatDoc/OCRFlux-bench-cross
+- ChatDoc/OCRFlux-pubtabnet-single
+- ChatDoc/OCRFlux-pubtabnet-cross
+base_model:
+- Qwen/Qwen2.5-VL-3B-Instruct
+library_name: transformers
+---
+# OCRFlux
+This is a preview release of the OCRFlux model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from
+[olmOCR-mix-0225](https://huggingface.co/datasets/allenai/olmOCR-mix-0225) dataset.
+Quick links:
+- 🛠️ [Code](https://github.com/chatdoc-com/OCRFlux)
+## Usage
+The best way to use this model is via the [OCRFlux toolkit](https://github.com/chatdoc-com/OCRFlux).
+The toolkit comes with an efficient inference setup via vllm that can handle millions of documents
+at scale.
+## License and use
+OCRFlux is licensed under the Apache 2.0 license.
+OCRFlux is intended for research and educational use.