Spaces:
Running
Running
File size: 512 Bytes
42dc069 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
# OCR Preprocessing Triage
## Quick Fixes Implemented
1. **Handwritten** - Disabled thresholding, uses grayscale only
2. **Newspapers** - Increased block size (51) and constant (10) for softer thresholding
3. **JPEG Artifacts** - Auto-detection and specialized denoising
4. **Border Issues** - Crops edges after deskew to avoid threshold problems
5. **Low Resolution** - Upscales small text for better recognition
## Testing
```
python testing/test_triage_fix.py
```
Check `output/comparison/` for results.
|