Spaces:
Sleeping
Sleeping
A newer version of the Gradio SDK is available:
5.42.0
metadata
title: BrailleMenuGen
emoji: 🦀
colorFrom: green
colorTo: green
sdk: gradio
sdk_version: 5.29.0
app_file: app.py
pinned: false
Menu to Braille Converter
An AI-powered application that converts food menu images to Braille text for visually impaired users.
Features
- Upload menu images
- Extract text using AI-powered document understanding (LayoutLMv2)
- Process and structure menu text using LLMs
- Convert text to Braille
- Display Braille in multiple formats (text, visual, side-by-side)
- Download as PDF in different formats
Deployment on Hugging Face Spaces
Option 1: Direct GitHub Repository Deployment
- Fork this repository to your GitHub account
- Go to Hugging Face Spaces
- Click "Create new Space"
- Choose "Streamlit" as the SDK
- Connect your GitHub account and select this repository
- Choose hardware requirements (recommend at least GPU for better performance)
- Click "Create Space"
Option 2: Manual Deployment
- Go to Hugging Face Spaces
- Click "Create new Space"
- Choose "Streamlit" as the SDK
- Give your Space a name
- Choose hardware requirements (recommend at least GPU for better performance)
- Click "Create Space"
- Clone the Space repository locally
- Copy all files from this project to the cloned repository
- Push the changes to the Space repository
Hardware Requirements
- Minimum: CPU (2 vCPUs, 16 GB RAM)
- Recommended: GPU (T4 or better)
Models Used
- Document AI: microsoft/layoutlmv2-base-uncased
- Text Processing: meta-llama/Meta-Llama-3-8B-Instruct (with fallback to mistralai/Mistral-7B-Instruct-v0.2)
- Context Enhancement: facebook/bart-large-cnn
Local Development
- Clone this repository
- Install dependencies:
pip install -r requirements.txt
- Run the application:
streamlit run app.py
Future Enhancements
- Improved menu section recognition
- Support for multiple languages
- Physical Braille printer integration
- Mobile app version