ChatDOC
/

OCRFlux-3B

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions Community

OCRFlux-3B / README.md

ChatDOC's picture

Update README.md

333e5e6 verified 9 days ago

|

history blame contribute delete

926 Bytes

	---
	language:
	- en
	license: apache-2.0
	benchmarks:
	- ChatDoc/OCRFlux-bench-single
	- ChatDoc/OCRFlux-bench-cross
	- ChatDoc/OCRFlux-pubtabnet-single
	- ChatDoc/OCRFlux-pubtabnet-cross
	base_model:
	- Qwen/Qwen2.5-VL-3B-Instruct
	library_name: transformers
	---


	# OCRFlux-3B

	This is a preview release of the OCRFlux-3B model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from
	[olmOCR-mix-0225](https://huggingface.co/datasets/allenai/olmOCR-mix-0225) dataset.

	Quick links:
	- 🛠️ [Code](https://github.com/chatdoc-com/OCRFlux)

	## Usage

	The best way to use this model is via the [OCRFlux toolkit](https://github.com/chatdoc-com/OCRFlux).
	The toolkit comes with an efficient inference setup via vllm that can handle millions of documents
	at scale.

	## License and use

	OCRFlux is licensed under the Apache 2.0 license.
	OCRFlux is intended for research and educational use.