Skip to content

All the Data Analysis exploration projects will be present here either as jupyter 📓 or 🐍 code.

Notifications You must be signed in to change notification settings

prakass1/Data-Science-Lit

Repository files navigation

Data-Lit

All the Data Analysis exploration projects will be present here either as python jupyter notebooks or python code.

  1. 911 Emergency Dataset analysis: A analysis of time series data is done and visualizations are provided as images.

  2. Loan Lending Club: A complete Exploratory analysis of the dataset is done. Various boxplots and seaborn visuals are present.

  3. Headlines Extraction and analysis: An end-to-end project is done to extract the top headlines from newsapi.org and save it as csv. Using this csv analysis is done to show wordclouds, distribution of words in headlines and clustering of headlines.

  4. mothers day notebook: It was very heartly presented project to search for recent/popular tweets in time and use those text to visualize a shape like a mother.

  5. Spam Detection on YouTube Comments: The idea of the project was to detect spams in comment text of youtube. Dataset of individual singers were combined to check the spam and ham as compared to original paper who do it individually to each singer. But, a new approach convert :) -> Smile smileys in utf-8 format to text representation is created, along with it HashTagCounts, LinkCounts are utilized which boosts the performance of the classifier with SVM doing best of 93% which is close to the actual paper "Alberto, Túlio C., Johannes V. Lochter, and Tiago A. Almeida. "Tubespam: Comment spam filtering on youtube." 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA). IEEE, 2015.".