Sentence Similarity based on Semantic Nets and Corpus Statistics

This is an implementation of the paper written by Yuhua Li, David McLean, Zuhair A. Bandar, James D. O’Shea, and Keeley Crockett. Link

The Sentence Similarity has been implemented as a linear combination of Semantic and Word order Similarity. Semantic and Word order Similarities are calculated from semantic and order vectors computed for each sentence with the assistance of wordnet.

Modules Required

math
os
time
sys
numpy
sklearn
nltk

from nltk.corupus
- wordnet
- brown
- stopwords

Steps

Download the 2 main programs - similarity.py and main.py.
Construct the folder sub-structure as shown below:

similarity.py has all the main functions and will be called in main.py. Compile similarity.py first to make sure there are no errors. Then call the main.py
Put all the documents(text format) to be compared for similarity in the dataset sub-folder.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
main.py		main.py
similarity.py		similarity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

main.py

main.py

similarity.py

similarity.py

Repository files navigation

Sentence Similarity based on Semantic Nets and Corpus Statistics

Modules Required

Steps

About

Releases

Packages

Languages

rohanpillai20/Sentence-Similarity-based-on-Semantic-Nets-and-Corpus-Statistics

Folders and files

Latest commit

History

Repository files navigation

Sentence Similarity based on Semantic Nets and Corpus Statistics

Modules Required

Steps

About

Topics

Resources

Stars

Watchers

Forks

Languages