Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PaddlePaddle
company
Verified
AI & ML interests
Deep Learning Framework
Recent Activity
View all activity
Papers
GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese
Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON.
PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese