5CD-AI
/

Vintern-1B-v2

Image-Text-to-Text

feature-extraction

Model card Files Files and versions

khang119966 commited on Sep 12, 2024

Commit

7bea5e4

·

verified ·

1 Parent(s): 78869f1

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -76,6 +76,26 @@ Since there are still many different metrics that need to be tested, **we chose
     </tr>
 </table>
 We are still working on more detailed benchmarks.
 ## Examples

     </tr>
 </table>
+The benchmark result in [MTVQA](https://github.com/bytedance/MTVQA/tree/main)
+| Models                          | Open-Source | Vietnamese Score |
+|:----------------------------------:|:-------------:|:------------------:|
+| Qwen2-VL 72B (Top1)                  | ✗            | 41.6             |
+| GPT-4o (Top2)                         | ✗            | 34.2             |
+| **Vintern-1B-V2** (Top3)               | ✓           | **31.7**             |
+| Qwen2-VL 7B                      | ✓           | 30.0             |
+| Claude3 Opus                     | ✗           | 29.1             |
+| GPT-4o mini                      | ✗           | 29.1             |
+| GPT-4V                            | ✗           | 28.9             |
+| Vintern-1B-V3                    | ✓           | 28.7             |
+| Gemini Ultra                     | ✗           | 28.6             |
+| InternVL2 76B                    | ✓           | 26.9             |
+| QwenVL Max                       | ✗           | 23.5             |
+| Claude3 Sonnet                   | ✗           | 20.8             |
+| QwenVL Plus                      | ✗           | 18.1             |
+| MiniCPM-V2.5                     | ✓           | 15.3             |
+| InternVL-V1.5                    | ✗           | 12.4             |
 We are still working on more detailed benchmarks.
 ## Examples