Skip to content

It contains 3 Portfolio projects describing Exploratory analysis in cycling data, data analysis and prediction on COVID-19 data and ML model to predict genre of the books.

Notifications You must be signed in to change notification settings

akchaudhary57/Data-Science-Portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Portfolio

Portfolio 1

Analyzing Cycling data

Dataset:- The data set is an export of Strava and GoldenCheetah, an online social networking site for cycling and other sport.

Plotting and vistualization:- This Portfolio provides summary of cycling data which includes plotting histograms, heatmap, and subplots to represent clear and concise visualization of cycling data.

Portfolio 2

Analyzing COVID-19 data

Dataset:- The data is being made available by Johns Hopkins University in this GitHub.The plotting is performed for the confirmed cases. The portfolio shows visualization and analysis of COVID cases across several countries through line graph. Following feature has also been implemented:-

Running Bar graph:- Top 10 Counties With COVID-19 Cases Per Million YouTube. It is also available in the dataset.

Simple linear model to predict COVID cases particularly in US and China:- It contains simple linear model techniques to predict the log of the number of case across US. Also, the reason for China's data not showing exponential growth and it's act to stop the virus.

Portfolio 3

Predicting the genre of Books

Dataset:- We have use a set of book summeries from the CMU Book Summaries Corpus. It contains a large number of summaries (16,559) and includes meta-data about the genre of the books taken from Freebase.

Model used:- We have used MultiNomial NB model and Logistic Regression to predict the model.

Libraries:-

TfidfVectorizer:- We have used this library to convert a collection of raw documents to a matrix of TF-IDF (Term frequency and Inverse Document Frequency).

About

It contains 3 Portfolio projects describing Exploratory analysis in cycling data, data analysis and prediction on COVID-19 data and ML model to predict genre of the books.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published