Skip to content

yboyer/CSMCaen

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CSMCaen


Document Numérique Avancé Master DNR2i, 2eme Année – Semestre 1 Département d'Informatique Prof. Dr. Marc Spaniol

Project: “Social Media Sentiment Monitor” [about SM Caen]

Description:

The project consists of two (sub-)tasks. In a first step, social Web contents about SM Caen need to collected/crawled. These contents (min 1.000 documents per match day) should cover at least three matches and should be, e.g., acquired from tweets containing an associated hashtag (#). For further analytics, the contents should be parsed and the “meaningful” text extracted in order to “Identify” the overall sentiment (resp. sentiments in different sub-communities).

In a second step, a Web-based search interface needs to be developed. This interface should at least support temporal search and analytics for three match days (before and after). The analytics module should to this end be capable of visualizing the overall sentiment and/or retrieving the relevant documents, terms and/or entities associated with positive/negative sentiments. Enhancements include but are not limited to the identification of the most important entities, linking and/or incorporating match statistics as well trend analytics.

Specification:

The software you are supposed to develop should be deployed on the Web as a “Web search engine”. The interface needs to be productive (fast, usable and “appealing”). The minimum functionality specified above needs to be available, while additional “features” are a surplus. The choice of software is up to you, but if third party software/libraries is/are used, it must be open-source.



Technologies, libraries and APIs

Prerequistes

  • Composer
  • Node.js
  • npm
  • pip
    • twitter
    • python-dateutil
    • mysql-connector

Usage:

Install

Clone the GitHub repo:
git clone https://github.com/yboyer/csmcaen
cd csmcaen
Install dependencies:
cd website
npm i

Fill Database

Import database.sql to MySQL.

Crawling:

Edit start_date and end_date variables in facebookCrawler.py and twitterCrawler.py to crawl a given date.

python twitterCrawler.py
python facebookCrawler.py
Sentiment analysis:

Update the database datas by adding sentimental analysis.

python sentimentTwitter.py
python sentimentfacebook.py