Skip to content

sohamM97/deduplication

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

deduplication

ML model to remove deduplication of records

Requirements:

  • Python 2
  • Jupyter Notebook
  • numpy
  • dedupe

Instructions:-

  1. Make sure you have Jupyter Notebook installed. If not, do it by typing pip install jupyter in your console.

  2. Install numpy and dedupe by typing pip install numpy and pip install dedupe respectively in console.

  3. Navigate to the project folder.

  4. Open the .ipynb file by typing jupyter notebook .\Deduplication.ipynb in your console.

The notebook will open on a new browser. The code along with output and explanation will be present in the notebook.

About

ML model to remove deduplication of records

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published