File size: 512 Bytes
42dc069
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
# OCR Preprocessing Triage

## Quick Fixes Implemented

1. **Handwritten** - Disabled thresholding, uses grayscale only
2. **Newspapers** - Increased block size (51) and constant (10) for softer thresholding
3. **JPEG Artifacts** - Auto-detection and specialized denoising
4. **Border Issues** - Crops edges after deskew to avoid threshold problems
5. **Low Resolution** - Upscales small text for better recognition

## Testing

```
python testing/test_triage_fix.py
```

Check `output/comparison/` for results.