Spaces:

polygraf-ai
/

business_card_extractor

Running

App Files Files Community

rongo1 commited on Jul 15

Commit

c609854

1 Parent(s): dae9b98

fix

Browse files

Files changed (5) hide show

.gitignore +7 -1
README.md +69 -10
app.py +1 -1
env.example +8 -1
google.py → google_funcs.py +20 -0

.gitignore CHANGED Viewed

@@ -11,6 +11,10 @@ venv/
 # Environment variables
 .env
 # IDE
 .vscode/
 .idea/
@@ -39,4 +43,6 @@ Thumbs.db
 # Logs
 *.log
-business_card_extractor.log

 # Environment variables
 .env
+# Google Drive authentication files
+token.pickle
+google_token_base64.txt
 # IDE
 .vscode/
 .idea/
 # Logs
 *.log
+business_card_extractor.log
+convert_token_to_base64.py

README.md CHANGED Viewed

@@ -11,25 +11,26 @@ pinned: false
 # Business Card Data Extractor 💼
-An AI-powered tool that extracts structured data from business card images using Google's Gemini AI. Upload business card images and get organized data exported to Excel files.
 ## Features
 - **Batch Processing**: Process multiple business cards at once (up to 5 per batch)
 - **AI Model Selection**: Choose between Gemini 2.5 Flash (fast) or Gemini 2.5 Pro (accuracy)
 - **Excel Export**: Get data in two formats:
   - Current session results
   - Cumulative database (appends across sessions)
 - **Smart Data Extraction**: Extracts name, company, title, emails, phones, address, website
-- **Image Storage**: Option to save uploaded images with timestamps
 ## How to Use
-1. **Set API Key**: Add your Google Gemini API key as `Gemini_API` environment variable
 2. **Upload Images**: Select up to 5 business card images
 3. **Choose Model**: Select Gemini model (Flash for speed, Pro for accuracy)
 4. **Process**: Click "Extract Business Card Data"
-5. **Download**: Get Excel files with extracted data
 ## Supported Data Fields
@@ -42,12 +43,70 @@ An AI-powered tool that extracts structured data from business card images using
 - **Website**: Company website URL
 - **Processing Info**: Timestamp, model used, filename
-## Requirements
-- Google Gemini API key
-- Image formats: JPG, JPEG, PNG, WEBP
-- Maximum file size: 10MB per image
-## API Usage
-This app uses Google's Gemini AI for intelligent text extraction from business card images. Batch processing reduces API costs by processing multiple cards in a single request.

 # Business Card Data Extractor 💼
+An AI-powered tool that extracts structured data from business card images using Google's Gemini AI. Upload business card images and get organized data exported to Excel files with automatic Google Drive storage.
 ## Features
 - **Batch Processing**: Process multiple business cards at once (up to 5 per batch)
 - **AI Model Selection**: Choose between Gemini 2.5 Flash (fast) or Gemini 2.5 Pro (accuracy)
+- **Google Drive Storage**: Automatic upload to organized Drive folders
 - **Excel Export**: Get data in two formats:
   - Current session results
   - Cumulative database (appends across sessions)
 - **Smart Data Extraction**: Extracts name, company, title, emails, phones, address, website
+- **Direct Links**: Access files directly through Google Drive URLs
 ## How to Use
+1. **Setup**: Complete the setup process below (one-time)
 2. **Upload Images**: Select up to 5 business card images
 3. **Choose Model**: Select Gemini model (Flash for speed, Pro for accuracy)
 4. **Process**: Click "Extract Business Card Data"
+5. **Access Files**: Download temporary copies or access permanent files via Google Drive links
 ## Supported Data Fields
 - **Website**: Company website URL
 - **Processing Info**: Timestamp, model used, filename
+## Setup Instructions
+### 1. Google Gemini API
+- Get your API key from: https://aistudio.google.com/
+- Set as environment variable: `Gemini_API`
+### 2. Google Drive API Setup
+1. **Create Google Cloud Project**:
+   - Go to https://console.cloud.google.com/
+   - Create a new project or select an existing one
+2. **Enable Google Drive API**:
+   - In the Google Cloud Console, go to "APIs & Services" > "Library"
+   - Search for "Google Drive API" and enable it
+3. **Create OAuth 2.0 Credentials**:
+   - Go to "APIs & Services" > "Credentials"
+   - Click "+ CREATE CREDENTIALS" > "OAuth client ID"
+   - Select "Desktop application"
+   - Download the JSON file
+   - Extract `client_id` and `client_secret` from the JSON
+4. **Set Environment Variables**:
+   ```bash
+   GOOGLE_CLIENT_ID=your_client_id_here
+   GOOGLE_CLIENT_SECRET=your_client_secret_here
+   ```
+### 3. Local Development Setup
+1. **Install Dependencies**:
+   ```bash
+   pip install -r requirements.txt
+   ```
+2. **Run Locally First**:
+   ```bash
+   python app.py
+   ```
+   - Complete the OAuth flow in your browser
+   - This creates `token.pickle` file
+### 4. Deployment Setup (Hugging Face Spaces, etc.)
+1. **Generate Token for Deployment**:
+   ```bash
+   python convert_token_to_base64.py
+   ```
+   - This converts `token.pickle` to a base64 string
+2. **Set Environment Variables** in your deployment platform:
+   ```bash
+   Gemini_API=your_gemini_api_key
+   GOOGLE_CLIENT_ID=your_google_client_id
+   GOOGLE_CLIENT_SECRET=your_google_client_secret
+   GOOGLE_TOKEN_BASE64=your_base64_encoded_token
+   ```
+## Google Drive Folders
+- **📁 Exports**: https://drive.google.com/drive/folders/1k5iP4egzLrGJwnHkMhxt9bAkaCiieojO
+- **🖼️ Images**: https://drive.google.com/drive/folders/1gd280IqcAzpAFTPeYsZjoBUOU9S7Zx3c
+## Technical Details
+- **Image Formats**: JPG, JPEG, PNG, WEBP, BMP
+- **Maximum File Size**: 10MB per image
+- **Batch Processing**: Up to 5 cards per API call
+- **Storage**: Automatic upload to Google Drive
+- **Models**: Gemini 2.5 Flash (fast) / Pro (accurate)

app.py CHANGED Viewed

@@ -13,7 +13,7 @@ import sys
 import tempfile
 # Import Google Drive functionality
-from google import get_drive_service, upload_excel_to_exports_folder, upload_image_to_images_folder, list_files_in_folder
 # Configure logging
 logging.basicConfig(

 import tempfile
 # Import Google Drive functionality
+from google_funcs import get_drive_service, upload_excel_to_exports_folder, upload_image_to_images_folder, list_files_in_folder
 # Configure logging
 logging.basicConfig(

env.example CHANGED Viewed

@@ -15,7 +15,14 @@ Gemini_API=your_gemini_api_key_here
 GOOGLE_CLIENT_ID=your_google_client_id_here
 GOOGLE_CLIENT_SECRET=your_google_client_secret_here
 # Examples:
 # Gemini_API=AIzaSyBxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
 # GOOGLE_CLIENT_ID=1234567890-abcdefghijklmnopqrstuvwxyz.apps.googleusercontent.com
-# GOOGLE_CLIENT_SECRET=GOCSPX-xxxxxxxxxxxxxxxxxxxxxxxx

 GOOGLE_CLIENT_ID=your_google_client_id_here
 GOOGLE_CLIENT_SECRET=your_google_client_secret_here
+# Google Drive Token (Required for deployment environments)
+# Generate this by running the app locally first, then use convert_token_to_base64.py
+# For local development: Leave this empty (token.pickle will be created automatically)
+# For deployment: Set this to the base64 encoded token string
+GOOGLE_TOKEN_BASE64=your_base64_encoded_token_here
 # Examples:
 # Gemini_API=AIzaSyBxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
 # GOOGLE_CLIENT_ID=1234567890-abcdefghijklmnopqrstuvwxyz.apps.googleusercontent.com
+# GOOGLE_CLIENT_SECRET=GOCSPX-xxxxxxxxxxxxxxxxxxxxxxxx
+# GOOGLE_TOKEN_BASE64=gASVxwAAAAAAAAB9cQAoWBYAAABhY2Nlc3NfdG9rZW4... (very long string)

google.py → google_funcs.py RENAMED Viewed

@@ -1,5 +1,6 @@
 import os
 import pickle
 from google.auth.transport.requests import Request
 from google_auth_oauthlib.flow import InstalledAppFlow
 from googleapiclient.discovery import build
@@ -26,6 +27,22 @@ TOKEN_PICKLE_FILE = 'token.pickle'
 def get_drive_service():
     """Authenticates with Google and returns a Drive service object."""
     creds = None
     # The file token.pickle stores the user's access and refresh tokens.
     if os.path.exists(TOKEN_PICKLE_FILE):
         with open(TOKEN_PICKLE_FILE, 'rb') as token:
@@ -34,11 +51,13 @@ def get_drive_service():
     # If there are no (valid) credentials available, let the user log in.
     if not creds or not creds.valid:
         if creds and creds.expired and creds.refresh_token:
             creds.refresh(Request())
         else:
             if not CLIENT_ID or not CLIENT_SECRET:
                 raise ValueError("GOOGLE_CLIENT_ID and GOOGLE_CLIENT_SECRET environment variables are required")
             # Use client_config dictionary instead of a client_secret.json file
             client_config = {
                 "installed": {
@@ -55,6 +74,7 @@ def get_drive_service():
         # Save the credentials for the next run
         with open(TOKEN_PICKLE_FILE, 'wb') as token:
             pickle.dump(creds, token)
     return build('drive', 'v3', credentials=creds)

 import os
 import pickle
+import base64
 from google.auth.transport.requests import Request
 from google_auth_oauthlib.flow import InstalledAppFlow
 from googleapiclient.discovery import build
 def get_drive_service():
     """Authenticates with Google and returns a Drive service object."""
     creds = None
+    # --- NEW CODE FOR DEPLOYMENT ENVIRONMENTS ---
+    # If token file doesn't exist, try to create it from environment variable
+    if not os.path.exists(TOKEN_PICKLE_FILE):
+        encoded_token = os.environ.get('GOOGLE_TOKEN_BASE64')
+        if encoded_token:
+            logger.info("Found token in environment variable. Recreating token.pickle file.")
+            try:
+                decoded_token = base64.b64decode(encoded_token)
+                with open(TOKEN_PICKLE_FILE, "wb") as token_file:
+                    token_file.write(decoded_token)
+                logger.info("Successfully recreated token.pickle from environment variable")
+            except Exception as e:
+                logger.error(f"Failed to decode token from environment variable: {e}")
+    # --- END OF NEW CODE ---
     # The file token.pickle stores the user's access and refresh tokens.
     if os.path.exists(TOKEN_PICKLE_FILE):
         with open(TOKEN_PICKLE_FILE, 'rb') as token:
     # If there are no (valid) credentials available, let the user log in.
     if not creds or not creds.valid:
         if creds and creds.expired and creds.refresh_token:
+            logger.info("Refreshing expired credentials")
             creds.refresh(Request())
         else:
             if not CLIENT_ID or not CLIENT_SECRET:
                 raise ValueError("GOOGLE_CLIENT_ID and GOOGLE_CLIENT_SECRET environment variables are required")
+            logger.info("Starting OAuth flow for new credentials")
             # Use client_config dictionary instead of a client_secret.json file
             client_config = {
                 "installed": {
         # Save the credentials for the next run
         with open(TOKEN_PICKLE_FILE, 'wb') as token:
             pickle.dump(creds, token)
+            logger.info("Saved new credentials to token.pickle")
     return build('drive', 'v3', credentials=creds)