Document orientations #389

m-salewski · 2024-11-20T07:46:39Z

m-salewski
Nov 20, 2024

Hello, i was investigating Docling to scrape texts from various delivery-related documents. One document is a scan of product labels with different orientations on a A4 in portrait orientation.

Attached is a sample PDF with the labels like this:

!

Here are the bounding boxes for the page segments (red solid lines) and page cells (blue dotted lines): it shows the orientations compromise the OCR
!

I tried different rotations with the document: 270 degree helped to get parts of the rotated label but not what i expected (like the company name and address in the sample); 90 and 180 found some page segments but failed to detect any words. In the actual document, there are detectable words at all 4 orientations.

One of the easiest hacks was to use EasyOCR's rotation_info parameter in the reader. This helped in some parts but failed for others as this only really affects the page cells which are dependent on the page segments. Why doesn't this work? Is the OCR's orientation fixed for all page segments?

label_orientations.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document orientations #389

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Document orientations #389

m-salewski Nov 20, 2024

Replies: 0 comments

m-salewski
Nov 20, 2024