PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper β’ 2510.14528 β’ Published 18 days ago β’ 78
view article Article Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips By baidu β’ Sep 24 β’ 8
Qianfan-VL Collection Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios. β’ 4 items β’ Updated Sep 24 β’ 19
view article Article Unleashing the Full Potential of ERNIE4.5 using FastDeploy By baidu and 3 others β’ Sep 19 β’ 11
view article Article PP-OCRv5 on Hugging Face: A Specialized Approach to OCR By baidu and 5 others β’ Sep 10 β’ 108
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese β’ 13 items β’ Updated Sep 15 β’ 48
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. β’ 26 items β’ Updated Sep 24 β’ 174