Spaces:

Atotti
/

miipher-2-HuBERT-HiFi-GAN-v0.1

Running

App Files Files Community

miipher-2-HuBERT-HiFi-GAN-v0.1 / README.md

Atotti

Update README.md

4979b9d verified 4 months ago

preview code

raw

history blame contribute delete

1.66 kB

A newer version of the Gradio SDK is available: 5.49.1

Upgrade

metadata

title: Miipher 2 HuBERT HiFi GAN V0.1
emoji: ⚡
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.38.0
app_file: app.py
pinned: false
license: apache-2.0
python_version: 3.12.10
models:
  - Atotti/miipher-2-HuBERT-HiFi-GAN-v0.1

🎤 Miipher-2 Speech Enhancement Demo

This is a Gradio demo for Miipher-2, a high-quality speech enhancement model that combines HuBERT, Parallel Adapters, and HiFi-GAN vocoder.

Features

Real-time speech enhancement - Remove noise, reverb, and other degradations
Multilingual support - Built on mHuBERT-147 for 147 languages
High-quality output - 22.05kHz audio output
Easy to use - Simple drag-and-drop or microphone input

Model Details

Paper: Miipher-2: High-Quality Speech Enhancement
Model: Atotti/miipher-2-HuBERT-HiFi-GAN-v0.1
GitHub: open-miipher-2

How to Use

Upload an audio file or record using microphone
Click "Enhance Audio" button
Download the enhanced result

Technical Details

The model uses:

SSL Backbone: mHuBERT-147 (multilingual)
Adapter: Parallel adapters inserted at layer 6
Vocoder: HiFi-GAN trained on SSL features
Input: Any sample rate (auto-resampled to 16kHz)
Output: 22.05kHz enhanced audio

Citation

@article{miipher2024,
  title={Miipher-2: High-Quality Speech Enhancement via Self-Supervised Learning},
  author={Your Name and Others},
  journal={arXiv preprint arXiv:2505.04457},
  year={2024}
}