Skip to content

md-k-sarker/NaiveSearchEngine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Naive Search Engine

This illustrates the basic functionality of a Search Engine

Indexing

First raw text need to be analyzed.

  1. Case Folding
  2. Remove StopWords
  3. Stemming

After analyzing done different field has given different weightage.

Query Optimization

Query Optimization is highly important. Query need to be build using same Analyzer

Searching on existing Index

Return the matching documents

Built on top of Lucene API

Input: It takes input from User Interface. There is a textbox where user can give query.

Output: Output is shown on User Interface and in Console. Output is also available in text file format(.txt).

Various statistics is also available. Like Index Building Time, Searching Time, Document score etc

This is done for class project. Information Retrieval(CS:7800)

About

Indexing and Searching operation of Search Engine

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages