Spaces:
Runtime error
Runtime error
Commit
·
24e7c59
1
Parent(s):
7df75ff
update: docuementation with installation instructions
Browse files- README.md +38 -1
- docs/index.md +38 -1
- medrag_multi_modal/document_loader/load_image.py +2 -2
- medrag_multi_modal/retrieval/__init__.py +3 -0
README.md
CHANGED
|
@@ -1,3 +1,40 @@
|
|
| 1 |
# MedRAG Multi-Modal
|
| 2 |
|
| 3 |
-
Multi-modal RAG for medical docmain.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# MedRAG Multi-Modal
|
| 2 |
|
| 3 |
+
Multi-modal RAG for medical docmain.
|
| 4 |
+
|
| 5 |
+
## Installation
|
| 6 |
+
|
| 7 |
+
### For Development
|
| 8 |
+
|
| 9 |
+
For MacOS, you need to run
|
| 10 |
+
|
| 11 |
+
```bash
|
| 12 |
+
brew install poppler
|
| 13 |
+
```
|
| 14 |
+
|
| 15 |
+
For Debian/Ubuntu, you need to run
|
| 16 |
+
|
| 17 |
+
```bash
|
| 18 |
+
sudo apt-get install -y poppler-utils
|
| 19 |
+
```
|
| 20 |
+
|
| 21 |
+
Then, you can install the dependencies using uv in the virtual environment `.venv` using
|
| 22 |
+
|
| 23 |
+
```bash
|
| 24 |
+
git clone https://github.com/soumik12345/medrag-multi-modal
|
| 25 |
+
cd medrag-multi-modal
|
| 26 |
+
pip install -U pip uv
|
| 27 |
+
uv sync
|
| 28 |
+
```
|
| 29 |
+
|
| 30 |
+
After this, you need to activate the virtual environment using
|
| 31 |
+
|
| 32 |
+
```bash
|
| 33 |
+
source .venv/bin/activate
|
| 34 |
+
```
|
| 35 |
+
|
| 36 |
+
In the activated virtual environment, you need to install Flash Attention using
|
| 37 |
+
|
| 38 |
+
```bash
|
| 39 |
+
uv pip install flash-attn --no-build-isolation
|
| 40 |
+
```
|
docs/index.md
CHANGED
|
@@ -1,3 +1,40 @@
|
|
| 1 |
# MedRAG Multi-Modal
|
| 2 |
|
| 3 |
-
Multi-modal RAG for medical docmain.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# MedRAG Multi-Modal
|
| 2 |
|
| 3 |
+
Multi-modal RAG for medical docmain.
|
| 4 |
+
|
| 5 |
+
## Installation
|
| 6 |
+
|
| 7 |
+
### For Development
|
| 8 |
+
|
| 9 |
+
For MacOS, you need to run
|
| 10 |
+
|
| 11 |
+
```bash
|
| 12 |
+
brew install poppler
|
| 13 |
+
```
|
| 14 |
+
|
| 15 |
+
For Debian/Ubuntu, you need to run
|
| 16 |
+
|
| 17 |
+
```bash
|
| 18 |
+
sudo apt-get install -y poppler-utils
|
| 19 |
+
```
|
| 20 |
+
|
| 21 |
+
Then, you can install the dependencies using uv in the virtual environment `.venv` using
|
| 22 |
+
|
| 23 |
+
```bash
|
| 24 |
+
git clone https://github.com/soumik12345/medrag-multi-modal
|
| 25 |
+
cd medrag-multi-modal
|
| 26 |
+
pip install -U pip uv
|
| 27 |
+
uv sync
|
| 28 |
+
```
|
| 29 |
+
|
| 30 |
+
After this, you need to activate the virtual environment using
|
| 31 |
+
|
| 32 |
+
```bash
|
| 33 |
+
source .venv/bin/activate
|
| 34 |
+
```
|
| 35 |
+
|
| 36 |
+
In the activated virtual environment, you need to install Flash Attention using
|
| 37 |
+
|
| 38 |
+
```bash
|
| 39 |
+
uv pip install flash-attn --no-build-isolation
|
| 40 |
+
```
|
medrag_multi_modal/document_loader/load_image.py
CHANGED
|
@@ -13,8 +13,8 @@ from medrag_multi_modal.document_loader.load_text import TextLoader
|
|
| 13 |
|
| 14 |
class ImageLoader(TextLoader):
|
| 15 |
"""
|
| 16 |
-
ImageLoader is a class that extends the `TextLoader` class to handle the extraction and
|
| 17 |
-
loading of
|
| 18 |
|
| 19 |
This class provides functionality to convert specific pages of a PDF document into images
|
| 20 |
and optionally publish these images to a Weave dataset.
|
|
|
|
| 13 |
|
| 14 |
class ImageLoader(TextLoader):
|
| 15 |
"""
|
| 16 |
+
`ImageLoader` is a class that extends the `TextLoader` class to handle the extraction and
|
| 17 |
+
loading of pages from a PDF file as images.
|
| 18 |
|
| 19 |
This class provides functionality to convert specific pages of a PDF document into images
|
| 20 |
and optionally publish these images to a Weave dataset.
|
medrag_multi_modal/retrieval/__init__.py
CHANGED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
from .multi_modal_retrieval import MultiModalRetriever
|
| 2 |
+
|
| 3 |
+
__all__ = ["MultiModalRetriever"]
|