This project involves using R to analyze movie data, then vocabulary trends for different categories such as age, gender, ethnicity, etc. It uses univariate analysis, multivariate analysis, linear regression, etc. The in-depth writeup can be found here
The original vocabulary data set can be found here
The original movie data set can be found here