extraction of certain information impossible #584
Replies: 2 comments 1 reply
-
Hi @Larbo53 Appreciate your interest in the library. I am not able to extract any text from that top left portion you have shared. If you try selecting the text over there, you'll see it is not possible and hence, you won't be able to extract that text via pdfplumber as well. As an alternative, you can run the PDF through an OCR software and then extract the required text. |
Beta Was this translation helpful? Give feedback.
-
In the case at hand the block of supplier information is not drawn using text drawing instructions. Instead it is drawn directly using filled paths of curves and lines. Regular text extractors (which look for text drawing instructions) won't recognize any text in that block. |
Beta Was this translation helpful? Give feedback.
-
Hello,
I would like to extract the supplier's information (top left block: Bourgeois Frères, . (see attached image) from the attached pdf.
What is the solution?
Os : MacOs Bigsur 11.6
Python : 3.9.7
pdfplumber : 0.6.0
Thank you.
Sincerely
BOURGEOISFacture 21053886.pdf
.
Beta Was this translation helpful? Give feedback.
All reactions