GitHub - md-k-sarker/NaiveSearchEngine: Indexing and Searching operation of Search Engine

Naive Search Engine

This illustrates the basic functionality of a Search Engine

Indexing

First raw text need to be analyzed.

Case Folding
Remove StopWords
Stemming

After analyzing done different field has given different weightage.

Query Optimization

Query Optimization is highly important. Query need to be build using same Analyzer

Searching on existing Index

Return the matching documents

Built on top of Lucene API

Input: It takes input from User Interface. There is a textbox where user can give query.

Output: Output is shown on User Interface and in Console. Output is also available in text file format(.txt).

Various statistics is also available. Like Index Building Time, Searching Time, Document score etc

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
src/main		src/main
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src/main

src/main

.gitignore

.gitignore

README.md

README.md

pom.xml

pom.xml

Repository files navigation

Naive Search Engine

This illustrates the basic functionality of a Search Engine

Indexing

Query Optimization

Searching on existing Index

This is done for class project. Information Retrieval(CS:7800)

About

Releases

Packages

Languages

md-k-sarker/NaiveSearchEngine

Folders and files

Latest commit

History

Repository files navigation

Naive Search Engine

This illustrates the basic functionality of a Search Engine

Indexing

Query Optimization

Searching on existing Index

This is done for class project. Information Retrieval(CS:7800)

About

Resources

Stars

Watchers

Forks

Languages