Skip to content

AlexHolmes4/Python-PY4E-Capstone

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

PY4E - CAPSTONE PROJECT

Overview

For the final (optional) assignment of the PY4E specalization students were to pick a data source, analyse it and visually represent patterns. I chose flight data from academic torrents . This data was too large to upload to the GitHub repository reliably, but you can download from the source link above as a tsv file.

"High Level Solution Architecture"

flightdatacrawler High Level Overview

  • A python program will run and extract the data into a raw storage database. This first extraction of data is unstructured, the stopping and restarting of data extraction will be incorporated in the python program - SQL query design.
  • The data can be processed, modelled and stored into a second structured relational database where querying is optimized.
  • This data can now visualized using Javascript. The D3 (Data Driven Documents) Javascript Library was chosen for the visualisation.

Solution Architecture - Multi Layered Analysis

flightdatacrawler High Level Solution

Data Model

data model flights

Screenshots of The Process and End Results

First fgather.py was used to retrieve the data source and insert it into a raw unstructured database (rawfdata.sqlite) A method to stop and continue retrieving where the program left off was introduced to handle large data retrieval.

gather

Next the data needed to be restructured, so the raw database was retrieved by fmodel.py and inserted into a new relational database, optimized for data retrieval. Again a method was introduced to allow the program to pick up from where it left off if interruptions occur.

commandline fmodel

fstructured DB

Once the data was modelled it could be analysed, such as flights frequency plotted on a chart, or visualizing connections between cities.

Visualising The Data

fbasic

flight line chart

force lots of data

About

My Capstone project for PY4E

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published