Train a model that classifies very unstructured text like Reddit comments to see if a recommendation engine could be built.
If subreddits cluster together, nearest clusters may be other subreddits that are "related" or "interesting" to a user.
Please view the final report PDF located in this repository @ martin-790-final-report.pdf
Kaggle for uploading the data set and great tutorials on Python's gensim and sklearn libraries