Spaces:

tommytracx
/

FluentQ

Paused

App Files Files Community

tommytracx commited on Apr 10, 2025

Commit

5ace8a9

verified ·

1 Parent(s): 588b9b6

Update README.md

Browse files

Files changed (1) hide show

README.md +56 -6

README.md CHANGED Viewed

@@ -1,9 +1,59 @@
 # AGI Telecom POC
-This is a full stack voice interface system powered by LLM, STT, TTS, and WebRTC-ready frontend.
-## Quick Start
-```bash
-pip install -r requirements.txt
-uvicorn app.main:app --reload
-```

 # AGI Telecom POC
+This Hugging Face Space demonstrates an AGI-powered telecom interface that enables voice and text interaction through telecommunication channels (WebRTC/SIP).
+## Overview
+This proof-of-concept showcases how AI assistants can be delivered through telecom infrastructure with:
+- Multimodal communication (voice + text)
+- Agentic intelligence (reasoning, memory)
+- Telecom-enabled delivery
+## Demo Usage
+This space provides two ways to interact with the system:
+1. **Gradio Interface**: A simplified interface that demonstrates core functionality
+   - Upload audio or use text input
+   - Get transcriptions, agent responses, and speech synthesis
+   - Manage conversation sessions
+2. **API Endpoints**: Direct API access for more advanced integration
+   - `/api/transcribe` - Convert audio to text
+   - `/api/query` - Process text with agent
+   - `/api/speak` - Convert text to speech
+   - `/api/session` - Create new conversation sessions
+## Architecture
+The system follows this processing flow:
+```
+[User Voice Input] → [Speech-to-Text] → [Agent Reasoning] → [Text-to-Speech Output] → [Telecom Network Delivery]
+```
+## Local Development
+To run this project locally:
+1. Clone the repository
+2. Install dependencies: `pip install -r requirements.txt`
+3. Run the app: `python app.py`
+4. Open http://localhost:8000 in your browser
+## Notes
+- This demo uses simplified mock implementations
+- For production use, you would replace the mock functions with:
+  - Whisper for speech-to-text
+  - A proper LLM (like LLAMA, Mistral) for reasoning
+  - A high-quality TTS engine
+  - Full WebRTC/SIP implementation
+## Future Extensions
+- Full SIP integration
+- Mesh networking with fallback intelligence
+- Enhanced multi-agent collaboration
+- Advanced contextual reasoning