Skip to content

The goal of this project is to perform an Explorartory Data Analysis with visualization and use a liner regression model to predict the number of bikes rented.

Notifications You must be signed in to change notification settings

isra-st/London_Bike_Sharing

Repository files navigation

London Bicycles Sharing

Goal of the project

  1. Perform an Explorartory Data Analysis with visualization and
  2. Use a liner regression model (OLS) to predict the number of bikes rented.

Tools Used

  • Pandas
  • Numpy
  • Matplotlib
  • Seaborn
  • Time
  • scikit-learn
  • statsmodels
  • Trello
  • MIRO

Resources

London bike sharing dataset https://www.kaggle.com/hmavrodiev/london-bike-sharing-dataset

OLS regression model performance

Train R^2: 0.628 - Train Adjusted R^2: 0.628

Test R^2: 0.655 - Test R^2: 0.582

  • Difference in R^2 between train and test 2.07%
  • Difference in R^2 between train and test is 4.6% which is less than 5%.

Test_VS_Prediction Test_VS_Prediction

Process

  1. Design:
  1. Clean, manipulate and create the visualizations.
  • Exploratory Data Analysis
  • Create visualizations
  • Create Dummies
  • Recursive Feature Selection (RFE)
  1. Create the linear regression model.
  • Validate the assumptions (Linearity, Autocorrelation, Sub-Normality, Normality, Multicollinearity)
  1. Analyze the model perforamnce
  2. Creating a Story Telling presentation

Presentation

To see the presentation, click in the below picture.

Test_VS_Prediction

About

The goal of this project is to perform an Explorartory Data Analysis with visualization and use a liner regression model to predict the number of bikes rented.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published