Skip to content

ShathaAlghamdi/New-York-City-Taxi-Trip-Duration-T5

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 

Repository files navigation

Proposal: NYC Trip-Duration Prediction

Introduction:

The goal of this statistical analysis is to help us understand the relationship between trip duration and other features in the dataset that affect the trip duration.

Business Problem:

Predect the trip duration time that depends on specific features extracted from the dataset.

Project Management Approach:

  • Scrum

Data description:

  • id: a unique identifier for each trip
  • vendor_id: a code indicating the provider associated with the trip record
  • pickup_datetime: date and time when the meter was engaged
  • dropoff_datetime: date and time when the meter was disengaged
  • passenger_count: the number of passengers in the vehicle (driver entered value)
  • pickup_longitude: the longitude where the meter was engaged
  • pickup_latitude the latitude where the meter was engaged
  • dropoff_longitude: the longitude where the meter was disengaged
  • dropoff_latitude: the latitude where the meter was disengaged
  • store_and_fwd_flag: This flag indicates whether the trip record was held in vehicle memory before sending to the vendor because the vehicle did not have a connection to the server - Y=store and forward; N=not a store and forward trip.
  • trip_duration: duration of the trip in seconds

Size of Data:

  • Number of rows: 1458644
  • Number of columns: 11

Dataset sourec:

from Kaggle website [Kaggle]

Questions:

  1. Trip Duration per Hour and per Day
  2. Is there a relationship between Distance and Trip Duration?
  3. Distance per Hour and per Days
  4. Is Vendor Id have relationships with Trip Duration?
  5. What Vendor have the most Trips?
  6. Is the Passenger number affect the Trip Duration?
  7. Which Days has the highest number of Passengers?
  8. What’s the number of Trips per Hour and per Day?
  9. Features Correlating with Trip Duration

Algorithms:

  • Arima Model

Tools:

Softwares:


  1. VScode
  2. Trello
  3. Jupyter
  4. Github
  5. PowerPoint
  6. Zoom

Languages & Libarry


  1. Python
    • Pandas
    • numpy
    • seaborn
    • plotly
    • sklearn

MVP goals:

  1. Find out what factors affect on the trip duration.
  2. Find how many trip per hour we can reach in one day.
  3. We want to see if there is a specific day that have alot of trafic.

Team Members

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •