Skip to content

Issues: project-lux/data-pipeline

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Pipeline should enforce valid URIs enhancement New feature to add to the code import An issue with harvesting, loading, importing data or generating internal indexes
#74 opened May 17, 2024 by azaroth42
Detect where LC points to Groups as Occupations bug The code does not behave as expected / designed
#73 opened May 15, 2024 by kkdavis14
Can't cache the last page in an AS in case it changes bug The code does not behave as expected / designed import An issue with harvesting, loading, importing data or generating internal indexes
#69 opened May 10, 2024 by azaroth42
Consider DBPedia as source new-dataset A new dataset to map, transform and use
#68 opened May 9, 2024 by azaroth42
Use super() not explicit parent class function calls enhancement New feature to add to the code
#60 opened Apr 30, 2024 by azaroth42
ORCID needs a null fetcher bug The code does not behave as expected / designed import An issue with harvesting, loading, importing data or generating internal indexes
#57 opened Apr 27, 2024 by azaroth42
If a record has an existing equivalent, should reconciler just trust it? discuss Issue needs discussion, but not necessarily blocked reconciliation An issue related to reconciling entities research An idea to implement and see if it improves the code, but might not
#48 opened Apr 24, 2024 by azaroth42
Detect WD disambig pages and don't allow external refs to be equiv to them reconciliation An issue related to reconciling entities research An idea to implement and see if it improves the code, but might not
#43 opened Apr 22, 2024 by kkdavis14
How to create Activities people participated_in research An idea to implement and see if it improves the code, but might not
#40 opened Apr 18, 2024 by kkdavis14
Test in LC mapper for ConferenceName blocked This issue is blocked by some other issue or dependency mapping An issue related to data mapping/transformation code
#35 opened Apr 12, 2024 by kkdavis14
Investigate voting in merger merging An issue related to merging records together research An idea to implement and see if it improves the code, but might not
#30 opened Mar 27, 2024 by azaroth42
Reduce number of unnecessary computed triples enhancement New feature to add to the code export An issue related to exporting data or loading generated data to external systems (e.g. MarkLogic)
#29 opened Mar 27, 2024 by azaroth42
Make architecture more coherent for data storage enhancement New feature to add to the code reconciliation An issue related to reconciling entities research An idea to implement and see if it improves the code, but might not storage An issue related to managing records in the internal caches
#28 opened Mar 27, 2024 by azaroth42
Prevent links from cross-class reconciled records identity-management An issue related to the idmap, reidentification or reference processing merging An issue related to merging records together reconciliation An issue related to reconciling entities research An idea to implement and see if it improves the code, but might not
#27 opened Mar 27, 2024 by azaroth42
Maintain history of ID merges and make available enhancement New feature to add to the code identity-management An issue related to the idmap, reidentification or reference processing
#26 opened Mar 27, 2024 by azaroth42
Investigate stopping collector after a given distance to reduce overmerges enhancement New feature to add to the code reconciliation An issue related to reconciling entities research An idea to implement and see if it improves the code, but might not
#25 opened Mar 27, 2024 by azaroth42
Ensure that TimeSpans always have botb and eote enhancement New feature to add to the code mapping An issue related to data mapping/transformation code
#22 opened Mar 22, 2024 by azaroth42
Add BNCF as source defer Work on this issue has been deferred until later new-dataset A new dataset to map, transform and use
#17 opened Mar 5, 2024 by kkdavis14
Integrate MarkLogic middleware enhancement New feature to add to the code export An issue related to exporting data or loading generated data to external systems (e.g. MarkLogic)
#16 opened Feb 29, 2024 by azaroth42
Automate the creation of test datasets enhancement New feature to add to the code testing The issue is to do with a test, or test infrastructure
#15 opened Feb 29, 2024 by azaroth42
Consider not reconciling records of different types reconciliation An issue related to reconciling entities research An idea to implement and see if it improves the code, but might not
#14 opened Feb 29, 2024 by azaroth42
Implement incremental update processing challenge This issue is hard! enhancement New feature to add to the code identity-management An issue related to the idmap, reidentification or reference processing import An issue with harvesting, loading, importing data or generating internal indexes
#13 opened Feb 29, 2024 by azaroth42
Japan authority data is almost never reached bug The code does not behave as expected / designed mapping An issue related to data mapping/transformation code
#12 opened Feb 29, 2024 by azaroth42
Investigate GADM as a source defer Work on this issue has been deferred until later new-dataset A new dataset to map, transform and use
#11 opened Feb 29, 2024 by azaroth42
Add kulturnav, SOCH as sources defer Work on this issue has been deferred until later new-dataset A new dataset to map, transform and use
#10 opened Feb 29, 2024 by azaroth42
ProTip! Type g i on any issue or pull request to go back to the issue listing page.