Update README.md
Browse files
README.md
CHANGED
@@ -1,14 +1,26 @@
|
|
1 |
---
|
2 |
-
title:
|
3 |
-
emoji:
|
4 |
colorFrom: green
|
5 |
-
colorTo:
|
6 |
sdk: gradio
|
7 |
-
sdk_version:
|
8 |
app_file: app.py
|
9 |
pinned: false
|
10 |
-
license: apache-2.0
|
11 |
-
short_description: Scalable African LAnguage Multimodal (voice and text) AI Fra
|
12 |
---
|
13 |
|
14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
title: Swahili Voice Assistant
|
3 |
+
emoji: ๐ค๐น๐ฟ
|
4 |
colorFrom: green
|
5 |
+
colorTo: blue
|
6 |
sdk: gradio
|
7 |
+
sdk_version: "4.31.0"
|
8 |
app_file: app.py
|
9 |
pinned: false
|
|
|
|
|
10 |
---
|
11 |
|
12 |
+
# ๐ค Msaidizi wa Sauti wa Kiswahili (Swahili Voice Assistant)
|
13 |
+
|
14 |
+
This is a multi-modal Swahili assistant built with Gradio and Hugging Face Transformers.
|
15 |
+
|
16 |
+
**Features:**
|
17 |
+
- **Speech-to-Speech:** Talk to the assistant and get a spoken response.
|
18 |
+
- **Text-to-Text:** Chat with the assistant using text.
|
19 |
+
- **Tools:** Includes standalone tools for speech transcription and text-to-speech synthesis.
|
20 |
+
|
21 |
+
**Models Used:**
|
22 |
+
- **ASR (STT):** `EYEDOL/SALAMA_C3` (You must replace this placeholder in `app.py`)
|
23 |
+
- **LLM:** `google/gemma-2-9b-it`
|
24 |
+
- **TTS:** A custom ONNX model based on `facebook/mms-tts-swh`.
|
25 |
+
|
26 |
+
**NOTE:** You must upload your `swahili_tts.onnx` file to this repository for the TTS functionality to work.
|