OCR on Layout Detection

by okayatul - opened 17 days ago

17 days ago

I have successfully done the layout detection, I just want to do the OCR on those boundary boxes and get the data but I am having trouble doing that please help.

ChengCui

PaddlePaddle org 16 days ago

In PP-StructureV3, the layout detection of PP-DocLayout_plus-L is only the first step. It is followed by the OCR module of your choice. You can refer to Introduction to PP-StructureV3 and PP-StructureV3 Pipeline Usage Tutorial for more information.

AtulOk

10 days ago

Okay, understood. I'm using the ppstructurev3 pipeline for parsing resumes, but when I give table-type resumes, it returns the markdown in HTML format. I expected it to return the markdown in a text file. How do I fix this? Because parsing is complicated here, the structure of the PDF is lost. I have also used tablerecognitionpipeline v2 and ppstructurev3 pipeline too.

AtulOk

9 days ago

In PP-StructureV3, the layout detection of PP-DocLayout_plus-L is only the first step. It is followed by the OCR module of your choice. You can refer to Introduction to PP-StructureV3 and PP-StructureV3 Pipeline Usage Tutorial for more information.

Please reply

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment