Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any interest in new feature: Getting URLs from pdfinfo? #257

Draft
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

jpreiss
Copy link

@jpreiss jpreiss commented Feb 17, 2023

I am using your library to rasterize PDFs in my presentation viewer https://github.com/jpreiss/pypdfdeck (branch videos).

I want to add a feature where any embedded URL that starts with file:// is interpreted to mean "instead of the PDF contents, display the video from this local path when viewing this page".

The command line pdfinfo can extract the URLs, but it is not exposed through the current python interface.

To do this, it would be nice if I can lean on pdf2image to properly find the poppler binaries, etc. Therefore, I added the option to extract URLs in pdfinfo_from_path().

This is not ready to merge - it needs design review, tests, equivalent _from_bytes() version, better docs, etc. Just wanted to check if this feature is actually desired before I finish the work.

Thanks!

fix type annotations
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant