olmocr-demo / README.md
leonarb's picture
Update README.md
a6d1230 verified
|
raw
history blame
656 Bytes
---
title: Olmocr Demo
emoji: 😻
colorFrom: red
colorTo: red
sdk: docker
sdk_version: 5.29.0
app_file: app.py
pinned: false
---
# PDF to EPUB Converter (olmOCR)
This Gradio app converts a PDF into a clean EPUB using the [olmOCR](https://huggingface.co/allenai/olmOCR-7B-0225-preview) model. Each PDF page is rendered and processed through OCR, with the first page used as the cover. Metadata (title, author, language) can be entered manually.
## Features
- OCR via `olmOCR-7B-0225-preview`
- First page used as EPUB cover
- Input for title, author, and language
- EPUB output for ebook readers
## Requirements
Already defined in `requirements.txt`: