Skip to content
#

streamming

Here are 8 public repositories matching this topic...

An application that gathers streaming data from twitter dataset , uses feature selection techniques to extract relevant keywords from a tweet , performs clustering to detect events . It also takes input from user such that window specified by user allows to further perform clustering which in turn uses a forest data structure to achieve it.

  • Updated Mar 14, 2018
  • Jupyter Notebook

Extracted sample Tweets and their external links to detect fake news by using “kaggle’s fake news dataset”, PySpark, NLTK Library by TFIDF, ARI (Automated Readability Index), Syntactic POS, ORS (Online Relevance Score) , LDA, Word2Vec and 16 more NLP features. These features were Classified using Random Forest and Ada-Boosting.

  • Updated Dec 15, 2017
  • Python

Improve this page

Add a description, image, and links to the streamming topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the streamming topic, visit your repo's landing page and select "manage topics."

Learn more