A newer version of the Gradio SDK is available:
5.49.1
metadata
title: Urdu STT with GPT-OSS
emoji: ποΈ
colorFrom: red
colorTo: yellow
sdk: gradio
sdk_version: 5.35.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: High-quality Urdu STT with Faster-Whisper and LLM.
ποΈ Faster Urdu ASR
This Space provides state-of-the-art Urdu Automatic Speech Recognition (ASR) built on Faster-Whisper, fine-tuned for Urdu.
In addition to transcription, it offers optional polishing with Groqβs openai/gpt-oss-120b LLM to improve Urdu grammar, punctuation, and fluency.
β¨ Features
- π€ Audio input via upload or direct microphone recording
- π Multiple output formats: plain text,
.srt,.vtt,.json - β‘ Built on Faster-Whisper (CT2) for efficient GPU/CPU inference
- π€ Optional LLM polishing with Groq API for natural, improved Urdu text
- π Works with environment variable
GROQ_API_KEYor via UI input
π Usage
- Upload or record an Urdu audio file.
- Choose output format (
text,srt,vtt,json). - (Optional) Enable LLM Polishing to improve transcription quality.
- Provide a valid
GROQ_API_KEYif not set in your environment. - Adjust temperature and system prompt as needed.
- Provide a valid
- Click Transcribe and view/download your results.