ML model to remove deduplication of records
Requirements:
- Python 2
- Jupyter Notebook
- numpy
- dedupe
-
Make sure you have Jupyter Notebook installed. If not, do it by typing
pip install jupyter
in your console. -
Install numpy and dedupe by typing
pip install numpy
andpip install dedupe
respectively in console. -
Navigate to the project folder.
-
Open the .ipynb file by typing
jupyter notebook .\Deduplication.ipynb
in your console.
The notebook will open on a new browser. The code along with output and explanation will be present in the notebook.