Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
AI & ML interests
Deep Learning Framework
Recent Activity
View all activity
Papers
GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese
-
PP-OCRv5 Online Demo
π75Universal-Scene Text Recognition Model with High-Accuracy
-
PaddlePaddle/PP-OCRv5_mobile_det
Image-to-Text β’ Updated β’ 47.3k β’ 18 -
PaddlePaddle/PP-OCRv5_mobile_rec
Image-to-Text β’ Updated β’ 8.04k β’ 8 -
PaddlePaddle/PP-OCRv5_server_det
Image-to-Text β’ Updated β’ 315k β’ 50
-
PaddlePaddle/arabic_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 556 β’ 1 -
PaddlePaddle/chinese_cht_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 35 -
PaddlePaddle/cyrillic_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 216 -
PaddlePaddle/devanagari_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 210
Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON.
PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese
-
PP-OCRv5 Online Demo
π75Universal-Scene Text Recognition Model with High-Accuracy
-
PaddlePaddle/PP-OCRv5_mobile_det
Image-to-Text β’ Updated β’ 47.3k β’ 18 -
PaddlePaddle/PP-OCRv5_mobile_rec
Image-to-Text β’ Updated β’ 8.04k β’ 8 -
PaddlePaddle/PP-OCRv5_server_det
Image-to-Text β’ Updated β’ 315k β’ 50
-
PaddlePaddle/arabic_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 556 β’ 1 -
PaddlePaddle/chinese_cht_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 35 -
PaddlePaddle/cyrillic_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 216 -
PaddlePaddle/devanagari_PP-OCRv3_mobile_rec
Image-to-Text β’ Updated β’ 210