Skip to content

J535D165/recordlinkage-annotator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RecordLinkage ANNOTATOR

RecordLinkage ANNOTATOR is a browser-based user interface for manual labeling of record pairs. Manual labeled or annotated data is useful for training and validation models. The application provides the annotator with a clean and intuitive interface. The annotator labels the record pair as "Match" or "Distinct". The output can be exported and used for training and validation. This application is part of the Python Record Linkage Toolkit, but can be used as a standalone tool.

➡️ ➡️ Check out the hosted version of RecordLinkage ANNOTATOR ⬅️ ⬅️

Want to give it a try? Download our Prison book example file 👮 :suspect:

Review screen of RecordLinkage ANNOTATOR

The hosted version of RecordLinkage ANNOTATOR on Github-pages makes use of client-side Javascript only. The tool doesn't upload data. Not convinced about the safety of your data? That's fine. You can deploy the code yourself locally. Please read the Development instructions.

Create annotation file

RecordLinkage ANNOTATOR requires an annotation file as input. This JSON file contains your record pairs and defines a comparison schema. The Python Record Linkage Toolkit can be used to render such a file. For more information, see the documentation on Manual Labeling.

Version 1 of the schema is open source and can be found here. Examples of annotation files can be found in the examples folder.

Extract results

RecordLinkage ANNOTATOR exports the results of the annotation in an annotation file with the same structure as the input annotation file. This makes it simple to review the annotation or continue labeling the data. The Python Record Linkage Toolkit can be used to read the annotation file and extract the links and distinct pairs. For more information, see the documentation on Manual Labeling.

Development

RecordLinkage ANNOTATOR is a React application. You can develop or deploy the project locally with npm start.

License

BSD 3-Clause License