Update README.md
Browse files
README.md
CHANGED
|
@@ -41,16 +41,17 @@ chandra_app
|
|
| 41 |
|
| 42 |
## Benchmarks
|
| 43 |
|
| 44 |
-
| **Model** | ArXiv | Old Scans Math | Tables | Old Scans | Headers and Footers | Multi column | Long tiny text |
|
| 45 |
-
|
| 46 |
-
| Datalab Chandra v0.1.0 |
|
| 47 |
-
| Datalab Marker v1.10.0 | **83.8** |
|
| 48 |
-
| Mistral OCR API | 77.2 |
|
| 49 |
-
| Deepseek OCR | 75.2 |
|
| 50 |
-
| GPT-4o (Anchored) | 53.5 |
|
| 51 |
-
| Gemini Flash 2 (Anchored) | 54.5 |
|
| 52 |
-
| Qwen 3 VL | 70.2 |
|
| 53 |
-
| olmOCR v0.3.0 | 78.6 |
|
|
|
|
| 54 |
|
| 55 |
## Examples
|
| 56 |
|
|
|
|
| 41 |
|
| 42 |
## Benchmarks
|
| 43 |
|
| 44 |
+
| **Model** | ArXiv | Old Scans Math | Tables | Old Scans | Headers and Footers | Multi column | Long tiny text | Base | Overall | Source |
|
| 45 |
+
|:----------|:--------:|:--------------:|:--------:|:---------:|:-------------------:|:------------:|:--------------:|:----:|:--------------:|:------:|
|
| 46 |
+
| Datalab Chandra v0.1.0 | 82.2 | **80.3** | **88.0** | **50.4** | 90.8 | 81.2 | **92.3** | **99.9** | **83.1 ± 0.9** | Own benchmarks |
|
| 47 |
+
| Datalab Marker v1.10.0 | **83.8** | 69.7 | 74.8 | 32.3 | 86.6 | 79.4 | 85.7 | 99.6 | 76.5 ± 1.0 | Own benchmarks |
|
| 48 |
+
| Mistral OCR API | 77.2 | 67.5 | 60.6 | 29.3 | 93.6 | 71.3 | 77.1 | 99.4 | 72.0 ± 1.1 | olmocr repo |
|
| 49 |
+
| Deepseek OCR | 75.2 | 72.3 | 79.7 | 33.3 | 96.1 | 66.7 | 80.1 | 99.7 | 75.4 ± 1.0 | Own benchmarks |
|
| 50 |
+
| GPT-4o (Anchored) | 53.5 | 74.5 | 70.0 | 40.7 | 93.8 | 69.3 | 60.6 | 96.8 | 69.9 ± 1.1 | olmocr repo |
|
| 51 |
+
| Gemini Flash 2 (Anchored) | 54.5 | 56.1 | 72.1 | 34.2 | 64.7 | 61.5 | 71.5 | 95.6 | 63.8 ± 1.2 | olmocr repo |
|
| 52 |
+
| Qwen 3 VL | 70.2 | 75.1 | 45.6 | 37.5 | 89.1 | 62.1 | 43.0 | 94.3 | 64.6 ± 1.1 | Own benchmarks |
|
| 53 |
+
| olmOCR v0.3.0 | 78.6 | 79.9 | 72.9 | 43.9 | **95.1** | 77.3 | 81.2 | 98.9 | 78.5 ± 1.1 | olmocr repo |
|
| 54 |
+
| dots.ocr | 82.1 | 64.2 | 88.3 | 40.9 | 94.1 | **82.4** | 81.2 | 99.5 | 79.1 ± 1.0 | dots.ocr repo |
|
| 55 |
|
| 56 |
## Examples
|
| 57 |
|