Skip to content

Commit

Permalink
update labelling documentation
Browse files Browse the repository at this point in the history
  • Loading branch information
sonalgoyal committed Jan 2, 2022
1 parent ccc90bd commit 045d0e8
Show file tree
Hide file tree
Showing 4 changed files with 17 additions and 2 deletions.
Binary file added assets/update.gif
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 1 addition & 1 deletion docs/generatingDocumentation.md
Expand Up @@ -3,6 +3,6 @@ layout: default
nav_order: 6
---
## Generating Documentation
Zingg allows generating readable documentation about the training data, including those marked as matches as non matches. The documentation is written to the zinggDir/modelId folder and can be built using the following
Zingg generates readable documentation about the training data, including those marked as matches as non matches. The documentation is written to the zinggDir/modelId folder and can be built using the following

`./scripts/zingg.sh --phase generateDocs --conf <location to conf.json>`
4 changes: 3 additions & 1 deletion docs/stepByStep.md
Expand Up @@ -31,4 +31,6 @@ The training data in Step 4 above is used to train Zingg and build and save the
### Step 6: Voila, lets match!
Its now time to apply the model above on our data. This si done by running the *match* or the *link* phases depending on whether you are matching within a single source or linking multiple sources respectively. You can read more about [matching](setup/match.md) and [linking](setup/linking.md)

As long as your input columns and the field types are not changing, the same model should work and you do not need to build a new model.
As long as your input columns and the field types are not changing, the same model should work and you do not need to build a new model. If you change the match type, you can cotinue to use the training data and add more labelled pairs on top of it.


13 changes: 13 additions & 0 deletions docs/updatingLabels.md
@@ -0,0 +1,13 @@
---
layout: default
nav_order: 6
---
## Updating Labeled Pairs
As our understanding of our data changes, we may need to revisit the previously marked pairs and update them. To do this, please [generate the documentation of the model.](./generatingDocumentation.md)

You can then invoke the updater by invoking
`./scripts/zingg.sh --phase updateLabel --conf <location to conf.json>`

This brings up the console labeller which accepts the cluster id of the pairs you want to update.

![Shows records and asks user to update yes, no, cant say on the cli.](/assets/update.gif)

0 comments on commit 045d0e8

Please sign in to comment.