GitHub - yash21saraf/Multilingual-Search-Engine: Solr powered full stack project for twitter.

Multilingual Search Engine - CSE-535

All the work is done as the part of the final project for CSE535.

The goal of the project is to implement concepts learnt as a part of the course to create a end to end search engine and deploy it on public server.

Experimentation Data -

Data for this project was collected from Twitter using the Twitter API. In addition a GEO coding API was used in order to derive the country of origin for some tweets. Approximately 2 Million tweets downloaded spread over a month.

Implementation Details-

Solr Backend -
- BM25 Similarity Factory with k1 and b values as 2.4 and 0.2 respectively.
- Deduplication used to eliminate duplicates.
Logs -
- Query logs were stored in Database. The query logs included the query, timestamp, and all the tweet IDs which were returned for that particular query.
- Relevance Logs were also stored in different collection. If user clicks on particular result then that result is considered relevent and can be used to return better results. So tweetID, and query also stored in Database.
Frontend -
- Interactive UI using Javascript, HTML, and Bootstrap to use solr's faceted query.
- Dynamic graphs which interacts with backend on the fly.
- Pagination to manage returned tweets.
Analytics -
- Time Series Analysis - Per day basis time series analysis for entire data, along with filter wise time series analysis.
- Sentiment Analysis - Donut chart representing the average sentiment for the particular query. Time series graphs for sentiment also generated.
- HashTags and Mentions - Extracting top 10 hashtags and mentions for returned results.
- Pie Charts - Pie charts to represent count based analysis for all tweets across different filters.
- Google Highcharts - Google Highcharts to represent region wise count on world map based on origin location.

Please check the detailed version here in report -Multilingual-Search-System - Report

Find the video for description for the same here -

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
documents		documents
python files		python files
static		static
templates		templates
.project		.project
Procfile		Procfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

documents

documents

python files

python files

static

static

templates

templates

.project

.project

Procfile

Procfile

README.md

README.md

app.py

app.py

requirements.txt

requirements.txt

Repository files navigation

Multilingual Search Engine - CSE-535

Implementation Details-

About

Releases

Packages

Languages

yash21saraf/Multilingual-Search-Engine

Folders and files

Latest commit

History

Repository files navigation

Multilingual Search Engine - CSE-535

Implementation Details-

About

Resources

Stars

Watchers

Forks

Languages