File size: 656 Bytes
7cad21b
 
 
 
 
f0c7105
7cad21b
 
 
 
 
a6d1230
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
title: Olmocr Demo
emoji: 😻
colorFrom: red
colorTo: red
sdk: docker
sdk_version: 5.29.0
app_file: app.py
pinned: false
---

# PDF to EPUB Converter (olmOCR)

This Gradio app converts a PDF into a clean EPUB using the [olmOCR](https://huggingface.co/allenai/olmOCR-7B-0225-preview) model. Each PDF page is rendered and processed through OCR, with the first page used as the cover. Metadata (title, author, language) can be entered manually.

## Features

- OCR via `olmOCR-7B-0225-preview`
- First page used as EPUB cover
- Input for title, author, and language
- EPUB output for ebook readers

## Requirements

Already defined in `requirements.txt`: