Skip to content

nhemanthrao23/bigdataproject

Repository files navigation

bigdataproject

Analyzing yelp reviews using topic modelling and aspect mining

Dependencies for the project:

The project in this repository was successfully executed on python 2.7 and spark 1.6

Install nltk for stemming #Run nltk.download() on pyspark for installing nltk. Type d (for download) on the prompt and then type all to download all nltk libraries.

Run LDA to find out the aspects from the reviews

Install text blob for finding the polarity of your aspects

pip install -U textblob

The summary of our analysis can be found at

https://github.com/nhemanthrao23/bigdataproject/blob/master/Analysis%20Summary.md

Presentation to our project

https://prezi.com/p/wkcbwtuwy3hf/

methodology