Update README.md
Browse files
README.md
CHANGED
|
@@ -1,14 +1,26 @@
|
|
| 1 |
---
|
| 2 |
-
title:
|
| 3 |
-
emoji:
|
| 4 |
colorFrom: green
|
| 5 |
-
colorTo:
|
| 6 |
sdk: gradio
|
| 7 |
-
sdk_version:
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
-
license: apache-2.0
|
| 11 |
-
short_description: Scalable African LAnguage Multimodal (voice and text) AI Fra
|
| 12 |
---
|
| 13 |
|
| 14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: Swahili Voice Assistant
|
| 3 |
+
emoji: ๐ค๐น๐ฟ
|
| 4 |
colorFrom: green
|
| 5 |
+
colorTo: blue
|
| 6 |
sdk: gradio
|
| 7 |
+
sdk_version: "4.31.0"
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
|
|
|
|
|
|
| 10 |
---
|
| 11 |
|
| 12 |
+
# ๐ค Msaidizi wa Sauti wa Kiswahili (Swahili Voice Assistant)
|
| 13 |
+
|
| 14 |
+
This is a multi-modal Swahili assistant built with Gradio and Hugging Face Transformers.
|
| 15 |
+
|
| 16 |
+
**Features:**
|
| 17 |
+
- **Speech-to-Speech:** Talk to the assistant and get a spoken response.
|
| 18 |
+
- **Text-to-Text:** Chat with the assistant using text.
|
| 19 |
+
- **Tools:** Includes standalone tools for speech transcription and text-to-speech synthesis.
|
| 20 |
+
|
| 21 |
+
**Models Used:**
|
| 22 |
+
- **ASR (STT):** `EYEDOL/SALAMA_C3` (You must replace this placeholder in `app.py`)
|
| 23 |
+
- **LLM:** `google/gemma-2-9b-it`
|
| 24 |
+
- **TTS:** A custom ONNX model based on `facebook/mms-tts-swh`.
|
| 25 |
+
|
| 26 |
+
**NOTE:** You must upload your `swahili_tts.onnx` file to this repository for the TTS functionality to work.
|