Spaces:
Sleeping
Sleeping
maslionok
commited on
Commit
·
3831088
1
Parent(s):
fa210be
small fix
Browse files- Dockerfile +5 -1
- README.md +0 -36
Dockerfile
CHANGED
|
@@ -1,4 +1,8 @@
|
|
| 1 |
-
FROM python:3.
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
|
| 3 |
WORKDIR /code
|
| 4 |
|
|
|
|
| 1 |
+
FROM python:3.11-slim
|
| 2 |
+
|
| 3 |
+
WORKDIR /code
|
| 4 |
+
|
| 5 |
+
RUN pip install --no-cache-dir --upgrade gradio==4.44.0 impresso-pipelines[ocrqa]==0.4.3.24python:3.9
|
| 6 |
|
| 7 |
WORKDIR /code
|
| 8 |
|
README.md
CHANGED
|
@@ -8,39 +8,3 @@ pinned: false
|
|
| 8 |
short_description: OCR Quality Assessment demo for Impresso project
|
| 9 |
---
|
| 10 |
|
| 11 |
-
|
| 12 |
-
|
| 13 |
-
# OCR Quality Assessment Pipeline Demo
|
| 14 |
-
|
| 15 |
-
This demo showcases the **OCR Quality Assessment Pipeline** from the Impresso project, which analyzes and improves text extracted from OCR (Optical Character Recognition).
|
| 16 |
-
|
| 17 |
-
## Features
|
| 18 |
-
|
| 19 |
-
- **OCR Error Detection**: Identifies common OCR mistakes and artifacts
|
| 20 |
-
- **Quality Assessment**: Evaluates the overall quality of OCR text
|
| 21 |
-
- **Text Correction**: Suggests improvements for detected errors
|
| 22 |
-
- **Interactive Interface**: User-friendly Gradio web interface
|
| 23 |
-
|
| 24 |
-
## Usage
|
| 25 |
-
|
| 26 |
-
The demo accepts OCR text input and provides:
|
| 27 |
-
- Quality assessment scores
|
| 28 |
-
- Detected OCR errors
|
| 29 |
-
- Suggested corrections
|
| 30 |
-
- Processed/improved text
|
| 31 |
-
|
| 32 |
-
## Example
|
| 33 |
-
|
| 34 |
-
Try the provided German text example that contains typical OCR errors like:
|
| 35 |
-
- Character misrecognition (e.g., "Zaubrisch" instead of "Zauberisch")
|
| 36 |
-
- Spacing issues (e.g., "nacb" instead of "nach")
|
| 37 |
-
- Punctuation errors (e.g., "d:m" instead of "dem")
|
| 38 |
-
|
| 39 |
-
## Installation
|
| 40 |
-
|
| 41 |
-
```bash
|
| 42 |
-
pip install -r requirements.txt
|
| 43 |
-
python app.py
|
| 44 |
-
```
|
| 45 |
-
|
| 46 |
-
The demo will be available at `http://localhost:7860`
|
|
|
|
| 8 |
short_description: OCR Quality Assessment demo for Impresso project
|
| 9 |
---
|
| 10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|