Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sample Data for ingestion #65

Open
afandian opened this issue Mar 4, 2019 · 0 comments
Open

Sample Data for ingestion #65

afandian opened this issue Mar 4, 2019 · 0 comments
Assignees
Labels
sprint-1 Sprint 1, 2019-03-04 sprint-2 2019-03-18 sprint-3 2019-04-01

Comments

@afandian
Copy link
Member

afandian commented Mar 4, 2019

We have two kinds of test data. The ‘corpus’ data is large but randomly chosen. The regression data is diverse, specific but out of date. Locate metadata that covers a selection of content types to give a reliable cross-section of our features.

Definition of done:

  • All of the work types enumerated in documentation. e.g. about 15 types.
  • All of the supporting input types.
  • For each, there is a good number (e.g. 100) of works that implement relevant features including any quirks.
  • Tests parse, with manually verified that they are correctly represented in regression test suite.
@afandian afandian added the sprint-1 Sprint 1, 2019-03-04 label Mar 4, 2019
@afandian afandian added this to the elastic-search-migration milestone Mar 4, 2019
@afandian afandian removed the sprint-1 Sprint 1, 2019-03-04 label Mar 4, 2019
@afandian afandian added the sprint-1 Sprint 1, 2019-03-04 label Mar 4, 2019
@ppolischuk ppolischuk added the sprint-2 2019-03-18 label Mar 18, 2019
@ppolischuk ppolischuk added the sprint-3 2019-04-01 label Mar 29, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
sprint-1 Sprint 1, 2019-03-04 sprint-2 2019-03-18 sprint-3 2019-04-01
Projects
No open projects
Development

No branches or pull requests

3 participants