metadata
title: Solr Normalization Demo
emoji: 🔥
colorFrom: blue
colorTo: indigo
sdk: docker
pinned: false
short_description: Solr text normalization pipeline demo for Impresso
Solr Normalization Demo
This demo showcases the text normalization pipeline that replicates Apache Solr's functionality, as used in the Impresso project.
Solr normalization is intended to give an idea of what kind of normalization is happening behind Impresso.
Features
- Multi-language support (German, French, Spanish, Italian, Portuguese, Dutch, English)
- Auto-language detection
- Tokenization and stopword removal
- Analyzer pipeline visualization
- Pre-loaded examples for quick testing
Try the live demo to see how different texts are processed through the normalization pipeline!
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference