Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to extract full table from PDF #150

Open
keerthip1121 opened this issue Dec 2, 2021 · 0 comments
Open

Unable to extract full table from PDF #150

keerthip1121 opened this issue Dec 2, 2021 · 0 comments

Comments

@keerthip1121
Copy link

keerthip1121 commented Dec 2, 2021

I was trying to extract table and convert it to excel from a PDF file. But full table is not extracted when using the flavor 'stream'. The full PDF table was divided into 2 table dfs(which I concated later, no problem with that) but some part of table data is not extracted. With flavor 'lattice' full table data is extracted but format is preferable with 'stream'. Can u please help to extract full table data with 'stream' itself.
In the submitted excel, sheet1 is data with flavor 'stream' and sheet2 with 'lattice'
pdf
pdf-excel226-11.xlsx
.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant