Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data sync #3

Open
gondzo opened this issue Dec 20, 2017 · 1 comment
Open

Data sync #3

gondzo opened this issue Dec 20, 2017 · 1 comment

Comments

@gondzo
Copy link
Collaborator

gondzo commented Dec 20, 2017

Since we need to support data sync with every new release of the data sets, we have some issues in tracking veteran records - there is no ID field for veterans or cemeteries, so we are using a combination of fields as an ID. right now those fields are {firstName-lastName-birthDate-deathDate-cemeteryID} as veteran ID, with Cemetery ID: {cemeteryName-address-zip}.

This ensures we populate the database only with complete data. If we don't use birth/death date or cemetery id as keys, we get a lot of duplicate records.
On the other hand, this does mean we will miss a lot of records that have partial data.
Is there anything that can be done about the data source (like having record ids)? @kbowerma

@kbowerma
Copy link
Contributor

Yeah, I think we need to figure out a way to create a key from the row, to ensure uniqueness as we insert. The problem with that is that if a record is edited from the source, it will be a new key.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants