Similarity between two documents.
-
Updated
Aug 6, 2022 - Python
Similarity between two documents.
was curious about how plagiarism checker works, ended up learning about something completely different 😂
Information Retrieval Lab
Document similarity using cosine distance, tf-idf, and latent semantic analysis.
NASA space apps 2022 local winner (Cairo). This project is the solution designed for the NASA space apps challenge hackathon 2022 by team NASART solving challenge: The Art in Our Worlds.
The framework that finds a perfect job match for you provided through scraped data from indeed.co.uk.
Assessing MinHash LSH for text similarity. Compares with kNN using BART embeddings as ground truth. Involves data preprocessing, shingle creation, LSH experiments. Findings inform LSH's efficiency in document similarity tasks, enhancing understanding of LSH techniques.
Q3 of Final Project Assignment of the course 'Foundations of Data Science' @ CBS
Big data homework solutions
Use of word embeddings and document similarity to solve word analogy problems
This movie recommendation system is designed to provide users with movie recommendations based on the similarity between movies. The system utilizes cosine similarity to identify movies that are closely related in terms of their features, allowing users to discover similar movies based on their preferences.
Given a set of documents and the minimum required similarity threshold find the number of document pairs that exceed the threshold
A PoC on document comparison using various methods in NLP
NLP on American workplace comedy TV pilot transcripts using multiple NLP libraries in Python.
document similarity using Spacy
NLP Projects
a search engine for Pubmed artitcal
NLP projects to understand and practice new concepts
Add a description, image, and links to the document-similarity topic page so that developers can more easily learn about it.
To associate your repository with the document-similarity topic, visit your repo's landing page and select "manage topics."