ai_ocr / README.md
utarn's picture
Update README.md
162b367 verified
|
raw
history blame
1.45 kB
metadata
title: Omni API Gradio UI
emoji: 🤖
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.46.1
app_file: app.py
pinned: false

Omni API Gradio UI

A Gradio-based user interface for the Omni API that supports text, PDF, image, and audio file processing.

Features

  • Text input for chat messages
  • Multiple file upload support (PDF, images, audio)
  • Configurable API base URL
  • Real-time response display
  • File ordering for multi-modal requests

Installation

# Install dependencies
uv sync

# Run the application
uv run python app.py

Development Mode (with auto-reload)

For development, you can use the auto-reload feature that will automatically restart the app when files change:

uv run python dev.py

This will monitor for changes in Python files, Markdown files, and TOML configuration files, automatically restarting the Gradio app when any of these files are modified.

Usage

  1. Configure the API base URL (defaults to https://api-omni.modelharbor.com)
  2. Enter your text message
  3. Upload files in the desired order (optional)
  4. Click "Send Request" to interact with the API
  5. View the response in the right panel

Supported File Types

  • PDFs: Document processing
  • Images: JPG, PNG, GIF, BMP, WEBP
  • Audio: MP3, WAV, M4A, FLAC, OGG

tags:

  • gradio
  • omni-api
  • multimodal
  • chat-interface
  • pdf-processing
  • image-processing
  • audio-processing
  • llm
  • api-client