Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PDF support #281

Open
oliver021 opened this issue Mar 17, 2021 · 5 comments
Open

PDF support #281

oliver021 opened this issue Mar 17, 2021 · 5 comments

Comments

@oliver021
Copy link

Hello friends, I was wondering if it would not be a good idea to include a metadata extractor for other types of files, such as pdf, excel sheets, word documents, etc, since these types of files contain a lot of metadata as well, and I have not seen any library topic on metadata extraction that covers that function, it would be very good since the title of this library is not really limited to metadata of multimedia files.

@drewnoakes
Copy link
Owner

The library is open to the addition of support for other kinds of data, with the following guidelines:

  • No dependencies on external libraries (we have only one exception to this for XMP processing)
  • Metadata must be representable using the directory/tag structure we use throughout

Support for PDF is being tracked in the sibling Java library in drewnoakes/metadata-extractor#327. I have no issue with supporting other document types as you suggest.

@oliver021
Copy link
Author

Okay, I make a pull request now, thanks for responding!

@drewnoakes
Copy link
Owner

@oliver021 fantastic, thanks.

@drewnoakes drewnoakes changed the title It would not be a good idea to include an extractor for other file types like pdf? PDF support Jun 7, 2021
@VincentMarnier
Copy link

Hello,
Is there any status about this?
I don't find the mentionned pull request.

@oliver021
Copy link
Author

Hello, I have not been able to do anything about it, I had a drastic change of plans in my schedule, and I find myself with a very short time

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants