kingabzpro commited on
Commit
980c187
Β·
verified Β·
1 Parent(s): 12dc36b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -3
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- title: Faster Urdu ASR
3
  emoji: 🏎️
4
  colorFrom: red
5
  colorTo: yellow
@@ -8,7 +8,27 @@ sdk_version: 5.35.0
8
  app_file: app.py
9
  pinned: false
10
  license: apache-2.0
11
- short_description: Best Urdu speech to text using faster-whisper.
12
  ---
13
 
14
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Urdu STT with GPT-OSS
3
  emoji: 🏎️
4
  colorFrom: red
5
  colorTo: yellow
 
8
  app_file: app.py
9
  pinned: false
10
  license: apache-2.0
11
+ short_description: High-quality Urdu STT with Faster-Whisper and LLM.
12
  ---
13
 
14
+ # 🏎️ Faster Urdu ASR
15
+
16
+ This Space provides **state-of-the-art Urdu Automatic Speech Recognition (ASR)** built on [Faster-Whisper](https://github.com/guillaumekln/faster-whisper), fine-tuned for Urdu.
17
+ In addition to transcription, it offers **optional polishing with Groq’s `openai/gpt-oss-120b` LLM** to improve Urdu grammar, punctuation, and fluency.
18
+
19
+ ## ✨ Features
20
+ - 🎀 **Audio input** via upload or direct microphone recording
21
+ - πŸ“œ Multiple output formats: plain text, `.srt`, `.vtt`, `.json`
22
+ - ⚑ Built on **Faster-Whisper (CT2)** for efficient GPU/CPU inference
23
+ - πŸ€– **Optional LLM polishing** with Groq API for natural, improved Urdu text
24
+ - πŸ”‘ Works with environment variable `GROQ_API_KEY` or via UI input
25
+
26
+ ## πŸš€ Usage
27
+ 1. Upload or record an Urdu audio file.
28
+ 2. Choose output format (`text`, `srt`, `vtt`, `json`).
29
+ 3. (Optional) Enable **LLM Polishing** to improve transcription quality.
30
+ - Provide a valid **`GROQ_API_KEY`** if not set in your environment.
31
+ - Adjust temperature and system prompt as needed.
32
+ 4. Click **Transcribe** and view/download your results.
33
+
34
+