Skip to content

Time series analysis and regression model to predict the number of taxi rides ordered in the next hour.

Notifications You must be signed in to change notification settings

adkwn1/time_series_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Time Series Analysis

Author: Andrew Kwon

Description

This project conducts a time series analysis of hourly taxi order volume over a period of six months. The main tasks of the project are as follows:

  • Identify trends and/or seasonality
  • Engineer new features based on the results of the time series analysis
  • Scale/normalize training data
  • Train and evaluate a prediction model for the given regression task

Introduction

We will train and evaluate a regression model for a taxi company that will predict the amount of taxi orders needed at airports in the next hour. The RMSE metric will be used for the test evaluation.

Dataset

taxi.csv

  • datetime: date (YYYY-MM-DD) and time (HH:MM:SS)
  • num_orders: number of orders

Requirements

  • pandas
  • matplotlib.pyplot
  • sklearn.linear_model
  • sklearn.tree
  • sklearn.ensemble
  • sklearn.preprocessing
  • sklearn.compose
  • sklearn.pipeline
  • sklearn.model_selection
  • sklearn.metrics
  • statsmodels.tsa.seasonal
  • statsmodels.graphics.tsaplots

Screenshots

f568e1d4-b18b-4cde-8c2d-51392a04de98 a4de3aa0-b032-4a89-8f08-f0b5cfc64b48 28983c4d-87a6-409d-968b-39883b5e55f7 4e85eabe-b181-4385-af11-6cb3017ca852

Releases

No releases published

Packages

No packages published