ImageDataExtractor3

Runtime error

App Files Files Community

WebashalarForML commited on Oct 4, 2024

Commit

d65fd5b

verified ·

1 Parent(s): 3dae3d4

Update README2.md

Browse files

Files changed (1) hide show

README2.md +7 -9

README2.md CHANGED Viewed

@@ -3,10 +3,9 @@ _\\-------- **Image Data Extractor** -------\\_
 _\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\_
 ---
 # Overview:
 The **Image Data Extractor** is a Python-based tool designed to extract and structure text data from images of visiting cards using **PaddleOCR**. The tool processes the extracted text to recognize key information such as name, designation, contact number, address, and company name, organizing the output into a well-defined structure. The **Mistral 7B model** is used for advanced text analysis, and if it becomes unavailable, the system seamlessly switches to the **Gliner urchade/gliner_mediumv2.1** model.
 # Installation Guide:
 1. **Create and Activate a Virtual Environment**
@@ -37,7 +36,7 @@ The **Image Data Extractor** is a Python-based tool designed to extract and stru
     ```bash
     HF_TOKEN=<your_huggingface_token>
     ```
 # File Structure Overview:
 ```
@@ -71,7 +70,7 @@ ImageDataExtractor/
 │
 └── .env                         # Environment variables (includes Hugging Face token)
 ```
 # Program Overview:
 ### PaddleOCR Integration (utility/utils.py):
@@ -88,7 +87,7 @@ ImageDataExtractor/
 ### Web Interface (app.py):
 - **Flask API**: Provides endpoints for image uploads and displays the results in a structured manner.
 - **HTML Interface**: A frontend for users to upload images of visiting cards and view the parsed results.
 # Tree Map of the Program:
 ```
@@ -108,11 +107,11 @@ Backup/backup.py
 └── Backup and error handling
 ```
 # Licensing:
 - **Mistral 7B model** is used under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
 - **Gliner urchade/gliner_mediumv2.1 model** is used under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
 # Main Task:
 The main objective is to extract and structure text data from visiting cards. The system identifies and organizes:
 - **Name**
@@ -120,7 +119,7 @@ The main objective is to extract and structure text data from visiting cards. Th
 - **Phone Number**
 - **Address**
 - **Company Name**
 # References:
 - [PaddleOCR Documentation](https://github.com/PaddlePaddle/PaddleOCR)
@@ -129,5 +128,4 @@ The main objective is to extract and structure text data from visiting cards. Th
 - [Flask Documentation](https://flask.palletsprojects.com/)
 - [Docker Documentation](https://docs.docker.com/)
 - [Virtual Environments in Python](https://docs.python.org/3/tutorial/venv.html)
 ---

 _\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\_
 ---
 # Overview:
 The **Image Data Extractor** is a Python-based tool designed to extract and structure text data from images of visiting cards using **PaddleOCR**. The tool processes the extracted text to recognize key information such as name, designation, contact number, address, and company name, organizing the output into a well-defined structure. The **Mistral 7B model** is used for advanced text analysis, and if it becomes unavailable, the system seamlessly switches to the **Gliner urchade/gliner_mediumv2.1** model.
+---
 # Installation Guide:
 1. **Create and Activate a Virtual Environment**
     ```bash
     HF_TOKEN=<your_huggingface_token>
     ```
+---
 # File Structure Overview:
 ```
 │
 └── .env                         # Environment variables (includes Hugging Face token)
 ```
+---
 # Program Overview:
 ### PaddleOCR Integration (utility/utils.py):
 ### Web Interface (app.py):
 - **Flask API**: Provides endpoints for image uploads and displays the results in a structured manner.
 - **HTML Interface**: A frontend for users to upload images of visiting cards and view the parsed results.
+---
 # Tree Map of the Program:
 ```
 └── Backup and error handling
 ```
+---
 # Licensing:
 - **Mistral 7B model** is used under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
 - **Gliner urchade/gliner_mediumv2.1 model** is used under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
+---
 # Main Task:
 The main objective is to extract and structure text data from visiting cards. The system identifies and organizes:
 - **Name**
 - **Phone Number**
 - **Address**
 - **Company Name**
+---
 # References:
 - [PaddleOCR Documentation](https://github.com/PaddlePaddle/PaddleOCR)
 - [Flask Documentation](https://flask.palletsprojects.com/)
 - [Docker Documentation](https://docs.docker.com/)
 - [Virtual Environments in Python](https://docs.python.org/3/tutorial/venv.html)
 ---