Skip to content

Search Engine Implemented in Python. Components : Web Crawler. Indexer. Parser. Page Ranking Algorithm

Notifications You must be signed in to change notification settings

vigneshnadar/Information-Retrieval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Information-Retrieval

  1. Each folder has a seperate README file
  2. Each folder contains different components of a limited scope search engine
  3. Web Crawler Bfs Dfs : This component is given a seed URL. Both breadth first search and depth first search algorithms were explored and compared for web crawling
  4. Retrieval Model: Parsing of the crawled html files and generating an index out of them for quick searches
  5. Page Rank Algorithm: Cosine Similarity , TF-IDF and BM25 algorithms were analyzed and compared for page ranking