Spaces:
Sleeping
Sleeping
title: FormIQ - Intelligent Document Parser | |
emoji: π | |
colorFrom: blue | |
colorTo: indigo | |
sdk: streamlit | |
sdk_version: 1.32.0 | |
app_file: app.py | |
pinned: false | |
# FormIQ - Intelligent Document Parser | |
FormIQ is an intelligent document parser that uses advanced AI models to extract and validate information from various types of documents. | |
## Features | |
- Document image upload and processing | |
- OCR text extraction using Tesseract | |
- Advanced document understanding using LayoutLMv3 | |
- Structured information extraction using Perplexity AI | |
- Interactive web interface built with Streamlit | |
## Technologies Used | |
- **Frontend**: Streamlit | |
- **OCR**: Tesseract | |
- **Document Understanding**: LayoutLMv3 | |
- **Text Processing**: Perplexity AI | |
- **Data Processing**: Pandas, NumPy | |
- **Visualization**: Plotly | |
## Setup | |
1. Clone the repository | |
2. Install dependencies: | |
```bash | |
pip install -r requirements.txt | |
``` | |
3. Set up environment variables: | |
```bash | |
PERPLEXITY_API_KEY=your_api_key_here | |
``` | |
## Usage | |
1. Run the Streamlit app: | |
```bash | |
streamlit run app.py | |
``` | |
2. Open your browser and navigate to the provided URL | |
3. Upload a document image | |
4. Click "Process Document" to extract information | |
## Hugging Face Spaces Deployment | |
This project is deployed on Hugging Face Spaces. You can access the live demo at: [Your Spaces URL] | |
## Contributing | |
Contributions are welcome! Please feel free to submit a Pull Request. | |
## License | |
This project is licensed under the MIT License - see the LICENSE file for details. | |