Skip to content

c-strasser/topic-modeling-nutella

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

topic-modeling-nutella

This project deals with identifying frequent topics expressed by French-speaking consumers on social media in posts about the product 'Nutella'.

The problem of 'topic modeling' has beem tackle with the widely used LDA algorithm.

The LDA algorithm and the process of cleaning the social posts has been coded in utils. These functionalities have been unit-tested.

The analysis and explanation of the project are to be found in a notebook

Dependencies

How to get started

  • After cloning the repo, create a folder 'tree_tagger' next to your copy of the repo
  • Install the tree tagger software in this folder
  • Install the other two dependencies