Spaces:

polygraf-ai
/

business_card_extractor

Running

App Files Files Community

rongo1 commited on Jul 15

Commit

595f63f

1 Parent(s): 29d41e1

fix: FIXED README

Browse files

Files changed (1) hide show

README.md +41 -106

README.md CHANGED Viewed

@@ -1,118 +1,53 @@
-# Business Card Data Extractor
-A Gradio-based application that extracts contact information from business card images using Google's Gemini API and exports the data to Excel.
-## Features
-- **Efficient Batch Processing**: Upload multiple cards, processed 5 at a time per API call
-- **Model Selection**: Choose between Gemini 2.5 Flash (fast) or Pro (accurate)
-- **Image Storage**: Optionally save business card images with timestamped filenames
-- **AI-Powered Extraction**: Uses Gemini AI to extract:
-  - Names (full name, first name, last name)
-  - Job titles and departments
-  - Company information
-  - Email addresses (multiple supported)
-  - Phone numbers (multiple supported)
-  - Addresses
-  - Websites and social media links
-  - Additional information
-- **Excel Export**: Automatically creates formatted Excel files
-- **Data Consolidation**: Multiple emails/phones are combined with commas in a single cell
-## Installation
-1. Clone this repository
-2. Install dependencies:
-   ```bash
-   pip install -r requirements.txt
-   ```
-## Usage
-1. Run the application:
-   ```bash
-   python app.py
-   ```
-2. Open your browser to the provided URL (typically http://localhost:7860)
-3. Upload one or more business card images
-4. Click "Process Business Cards"
-5. Download the generated Excel file
-## Output Format
-**Two Excel files are generated:**
-1. **Current Run File**: Contains only the cards from the current session
-2. **Total Database File**: Contains ALL cards ever processed (cumulative)
-Each business card creates one row in the Excel file with columns for:
-- filename
-- processed_date
-- method (AI model used: gemini-2.5-flash or gemini-2.5-pro)
-- saved_image_path (path to saved image file, if image saving is enabled)
-- full_name, first_name, last_name
-- job_title, company, department
-- emails (comma-separated if multiple)
-- phones (all types combined, comma-separated if multiple)
-- address (street and full address combined)
-- city, state, postal_code, country
-- website, linkedin
-- And more...
-## Configuration
-### Environment Variables
-Set the following environment variable:
-- `Gemini_API`: Your Google Gemini API key
-#### For Hugging Face Spaces:
-1. Go to your Space settings
-2. Add a new secret named `Gemini_API`
-3. Set the value to your Google Gemini API key
-#### For Local Development:
-```bash
-export Gemini_API="your_api_key_here"
-```
-Or create a `.env` file:
-```bash
-# Copy the example file
-cp env.example .env
-# Then edit .env with your actual API key
-```
-## Logging
-The application includes comprehensive logging:
-- **Log File**: `business_card_extractor.log` (created automatically)
-- **Console Output**: Real-time logging to terminal
-- **Log Levels**: INFO for general progress, DEBUG for detailed operations
-- **Coverage**: Every processing step, API calls, file operations, and errors
-Logs help with:
-- Debugging extraction issues
-- Monitoring API usage
-- Tracking processing performance
-- Identifying problematic business cards
-## File Structure
-```
-Business_Cards_analyzer/
-├── app.py                 # Main Gradio application
-├── requirements.txt       # Python dependencies
-├── env.example           # Environment variables template
-├── setup_hf_space.md     # Hugging Face deployment guide
-├── prompts/              # AI prompts for data extraction
-│   ├── prompt.txt
-│   └── system_prompt.txt
-├── business_card_exports/ # Output Excel files
-└── business_cards/       # Saved business card images (optional)
-    └── .gitkeep          # Ensures directory exists
-```

+---
+title: Business Card Data Extractor
+emoji: 💼
+colorFrom: blue
+colorTo: purple
+sdk: gradio
+sdk_version: "4.44.0"
+app_file: app.py
+pinned: false
+---
+# Business Card Data Extractor 💼
+An AI-powered tool that extracts structured data from business card images using Google's Gemini AI. Upload business card images and get organized data exported to Excel files.
+## Features
+- **Batch Processing**: Process multiple business cards at once (up to 5 per batch)
+- **AI Model Selection**: Choose between Gemini 2.5 Flash (fast) or Gemini 2.5 Pro (accuracy)
+- **Excel Export**: Get data in two formats:
+  - Current session results
+  - Cumulative database (appends across sessions)
+- **Smart Data Extraction**: Extracts name, company, title, emails, phones, address, website
+- **Image Storage**: Option to save uploaded images with timestamps
+## How to Use
+1. **Set API Key**: Add your Google Gemini API key as `Gemini_API` environment variable
+2. **Upload Images**: Select up to 5 business card images
+3. **Choose Model**: Select Gemini model (Flash for speed, Pro for accuracy)
+4. **Process**: Click "Extract Business Card Data"
+5. **Download**: Get Excel files with extracted data
+## Supported Data Fields
+- **Name**: Full name from business card
+- **Company**: Company/organization name
+- **Title**: Job title/position
+- **Emails**: Email addresses (comma-separated if multiple)
+- **Phones**: Phone numbers (comma-separated if multiple)
+- **Address**: Full address information
+- **Website**: Company website URL
+- **Processing Info**: Timestamp, model used, filename
+## Requirements
+- Google Gemini API key
+- Image formats: JPG, JPEG, PNG, WEBP
+- Maximum file size: 10MB per image
+## API Usage
+This app uses Google's Gemini AI for intelligent text extraction from business card images. Batch processing reduces API costs by processing multiple cards in a single request.