Skip to content

laxmanverma/search-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

search-engine

It is capable of answering single-word search queries efficiently. The user needs to give a seed page url as input. The seed page( A good seed page is the one with many links embedded in it) can be any valid url which is used by the program to crawl the web. Given the seed page url, the program takes a couple of minutes (usually 15-20 min) depending upon the seed page, to fetch and preprocess(ranking) the data crawled.

On calculating all the results, user can make infinite number of single-word queries and the program displays the top results based on the search. The very famous Page Rank algorithm has been used in the program. The results are sorted in decreasing order of their ranks computed with the ranks displayed alongside.

To clone the repository

$ git clone https://github.com/laxmanverma/search-engine.git

to run the script

$ python search_engine.py

About

It is a simple search engine coded in python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages