kingabzpro's picture
Update README.md
980c187 verified

A newer version of the Gradio SDK is available: 5.49.1

Upgrade
metadata
title: Urdu STT with GPT-OSS
emoji: 🏎️
colorFrom: red
colorTo: yellow
sdk: gradio
sdk_version: 5.35.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: High-quality Urdu STT with Faster-Whisper and LLM.

🏎️ Faster Urdu ASR

This Space provides state-of-the-art Urdu Automatic Speech Recognition (ASR) built on Faster-Whisper, fine-tuned for Urdu.
In addition to transcription, it offers optional polishing with Groq’s openai/gpt-oss-120b LLM to improve Urdu grammar, punctuation, and fluency.

✨ Features

  • 🎀 Audio input via upload or direct microphone recording
  • πŸ“œ Multiple output formats: plain text, .srt, .vtt, .json
  • ⚑ Built on Faster-Whisper (CT2) for efficient GPU/CPU inference
  • πŸ€– Optional LLM polishing with Groq API for natural, improved Urdu text
  • πŸ”‘ Works with environment variable GROQ_API_KEY or via UI input

πŸš€ Usage

  1. Upload or record an Urdu audio file.
  2. Choose output format (text, srt, vtt, json).
  3. (Optional) Enable LLM Polishing to improve transcription quality.
    • Provide a valid GROQ_API_KEY if not set in your environment.
    • Adjust temperature and system prompt as needed.
  4. Click Transcribe and view/download your results.