teste-teste-teste / README.md
arthrod's picture
Update README.md
f453b83 verified
---
license: apache-2.0
title: teste-teste-teste
sdk: gradio
emoji: πŸƒ
colorFrom: red
colorTo: red
short_description: yadayadayada
sdk_version: 5.33.0
---
# Real-Time Screen Assistant - Premium Edition
This is a premium real-time screen assistant that integrates Google's Gemini 2.0 Live API with advanced screen recording capabilities.
## Features
- πŸŽ™οΈ **Real-time Audio Streaming** - Voice activity detection with noise filtering
- πŸ–₯️ **Professional Screen Recording** - Native ScreenRecorder component with webcam overlay
- πŸ€– **AI Voice Responses** - Bidirectional audio communication with Gemini 2.0
- πŸ“ **Text Response Display** - Real-time text responses with conversation history
- πŸ”„ **Background Task Management** - Proper async handling and cleanup
- πŸ“Š **Performance Monitoring** - Real-time stats and adaptive quality
## Setup
1. Set your Google AI API key:
```bash
export GEMINI_API_KEY="your-api-key-here"
```
2. Install dependencies (automatic on HuggingFace Spaces):
```bash
pip install -r requirements.txt
```
3. Run the application:
```bash
python app.py
```
## Components
- **app.py** - Main application with premium real-time integration
- **gradio_screenrecorder/** - Custom Gradio component for screen recording
- **requirements.txt** - All necessary dependencies including custom components
## Environment Variables
- `GEMINI_API_KEY` - Required: Your Google AI API key for Gemini 2.0 Live API
## Real-time Integration
This application implements complete real-time frontend integration:
1. **Continuous Audio Flow** (User β†’ Model) - Voice activity detection
2. **Model Audio Output** (Model β†’ User) - AI voice responses
3. **Screen Recording Integration** - Professional screen capture
4. **Text Response Delivery** (System β†’ User) - Real-time text display
All features are optimized for 300-second real-time sessions with adaptive quality and intelligent throttling.