Extracting Data into Columns #778
chanpreet90
started this conversation in
Ask for help with specific PDFs
Replies: 2 comments 13 replies
-
Hi @chanpreet90, unfortunately, it's difficult to help without access to the PDF. Could you attach it? I would also recommend trying the "text" strategies listed in the table-extraction settings documentation. |
Beta Was this translation helpful? Give feedback.
2 replies
-
@chanpreet90 If "text" strategy doesn't help and there are no vertical separators, you can use the "explicit_vertical_lines" strategy. In your case, the extraction strategy as {
"vertical_strategy": "lines",
"horizontal_strategy": "lines",
"explicit_vertical_lines": [80, 360, 450, 560, 680]
} |
Beta Was this translation helpful? Give feedback.
11 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have a pdf which has data in tabular format and has 6 columns but the columns are not separated by boundaries so when I extract the data using pdfplumber, all the data comes in one column only and I want in separate columns.
How could I do that?
Beta Was this translation helpful? Give feedback.
All reactions