OCR on Layout Detection

#1
by okayatul - opened

I have successfully done the layout detection, I just want to do the OCR on those boundary boxes and get the data but I am having trouble doing that please help.

PaddlePaddle org

In PP-StructureV3, the layout detection of PP-DocLayout_plus-L is only the first step. It is followed by the OCR module of your choice. You can refer to Introduction to PP-StructureV3 and PP-StructureV3 Pipeline Usage Tutorial for more information.

Okay, understood. I'm using the ppstructurev3 pipeline for parsing resumes, but when I give table-type resumes, it returns the markdown in HTML format. I expected it to return the markdown in a text file. How do I fix this? Because parsing is complicated here, the structure of the PDF is lost. I have also used tablerecognitionpipeline v2 and ppstructurev3 pipeline too.

In PP-StructureV3, the layout detection of PP-DocLayout_plus-L is only the first step. It is followed by the OCR module of your choice. You can refer to Introduction to PP-StructureV3 and PP-StructureV3 Pipeline Usage Tutorial for more information.

Please reply

Sign up or log in to comment