historical-ocr / README.md
milwright's picture
Update historical-ocr application with enhanced features
d4d589a verified
|
raw
history blame
1.36 kB
metadata
title: Historical OCR with Contextual Intelligence
emoji: πŸ“œ
colorFrom: indigo
colorTo: purple
sdk: streamlit
sdk_version: 1.28.0
app_file: app.py
pinned: false

Historical OCR with Contextual Intelligence

An advanced OCR application for historical document analysis using Mistral AI.

Features

  • OCR with Context: AI-enhanced OCR optimized for historical documents
  • Document Type Detection: Automatically identifies handwritten letters, recipes, scientific texts, and more
  • Image Preprocessing: Optimizes images for better text recognition
  • Custom Prompting: Tailor the AI analysis with document-specific instructions
  • Structured Output: Returns organized, structured information based on document type

Using This App

  1. Upload a historical document (image or PDF)
  2. Add optional context or special instructions
  3. Get detailed, structured OCR results with historical context

Supported Document Types

  • Handwritten letters and correspondence
  • Historical recipes and cookbooks
  • Travel accounts and exploration logs
  • Scientific papers and experiments
  • Legal documents and certificates
  • Historical newspaper articles
  • General historical texts

Technical Details

Built with Streamlit and Mistral AI's OCR and large language model capabilities.


Created by Zach Muhlbauer, CUNY Graduate Center