Spaces:
Sleeping
Sleeping
A newer version of the Streamlit SDK is available:
1.45.1
metadata
title: FormIQ - Intelligent Document Parser
emoji: π
colorFrom: blue
colorTo: indigo
sdk: streamlit
sdk_version: 1.32.0
app_file: app.py
pinned: false
FormIQ - Intelligent Document Parser
FormIQ is an intelligent document parser that uses advanced AI models to extract and validate information from various types of documents.
Features
- Document image upload and processing
- OCR text extraction using Tesseract
- Advanced document understanding using LayoutLMv3
- Structured information extraction using Perplexity AI
- Interactive web interface built with Streamlit
Technologies Used
- Frontend: Streamlit
- OCR: Tesseract
- Document Understanding: LayoutLMv3
- Text Processing: Perplexity AI
- Data Processing: Pandas, NumPy
- Visualization: Plotly
Setup
- Clone the repository
- Install dependencies:
pip install -r requirements.txt
- Set up environment variables:
PERPLEXITY_API_KEY=your_api_key_here
Usage
- Run the Streamlit app:
streamlit run app.py
- Open your browser and navigate to the provided URL
- Upload a document image
- Click "Process Document" to extract information
Hugging Face Spaces Deployment
This project is deployed on Hugging Face Spaces. You can access the live demo at: [Your Spaces URL]
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE file for details.