Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changing the regex pattern for incoming PDFs #9

Open
skorasaurus opened this issue Sep 15, 2017 · 2 comments
Open

Changing the regex pattern for incoming PDFs #9

skorasaurus opened this issue Sep 15, 2017 · 2 comments

Comments

@skorasaurus
Copy link
Member

skorasaurus commented Sep 15, 2017

While running the extractor on my setup, I noticed that all of the PDFs that I had added were skipped by the extractor since the regex pattern for files only accept pdfs named like those already in the repo.

Should the pattern change or should we rename the files instead?

@skorasaurus
Copy link
Member Author

this is the first step to get a new instance up.

@skorasaurus
Copy link
Member Author

skorasaurus commented Jun 30, 2019

for consistency's sake, we had discussed in the past to use the ISO 8601 (YYYY-MM-DD) date pattern; as we had used at https://github.com/opencleveland/drocer although this was

I rewrote the regex pattern in https://github.com/skorasaurus/drocer-webapp/tree/fix-9-regex-input
so we can index PDFs using the ISO 8601 pattern
and double checking through the code now to see if this change has any other consequences;

The only noticeable consequence is when viewing results in the left-side window ( ); the formatting is

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant