David Freitag
5/13/21
CIS 9440 - Data Warehousing and Analystics
Project Objective: Follow the Kimball Lifecycle to design and develop a public, cloud-based Data Warehouse with a functioning BI application.
Project Tools: The tools used to build this Data Warehouse were:
- Data integration (ETL): Python
- Data Warehousing: Google BigQuery
- Business Intelligence: Tableau
Motivation for project: The ability to predict changes in the crime rate, as well as where those changes will occur, can lead to more effective and efficient deployment of police and security resources to discourage criminal activity.
Description of the issues or opportunities the project will address:
Issues presented by crime:
- The crime rate in Chicago results in significant financial cost to taxpayers as a result of the police department budget.
- Crime in Chicago has an economic cost on crime victims (i.e. theft of property).
- Crime in Chicago has a human cost on victims of crime, the families of those victims, and the families of the perpetrators.
- Crime in Chicago has a human and economic cost for perpetrators who find it challenging to join the job market after their interactions with the criminal justice system.
Opportunities that arise from this project:
- Reduce crime by more effectively deploying police resources to discourage criminal activity.
- Reduce costs of policing through a lower crime rate.
- Reduce the number of people interacting with the criminal justice system.
- Reduce recidivism of past offenders.
- Increased effectiveness of police activities.
- Increased efficiency of use of police resources.
Project Business or Organization Value: This project will allow the Chicago Police Department to make informed decisions about where and how to deploy police resources, resulting in a significantly reduced crime rate and the costs associated with that crime.
Data Sources:
List of Data Warehouse KPI's:
- Percentage of crimes resulting in arrest by location over a time period
- Total crimes compared to daily temperature over time period
- Number of crimes by location over a time period
- Total crimes compared to graduation rate over a time period
- Total crimes compared to unemployment rate over a time period
This project's Dimensional Model consists of four facts and three dimensions.
This project's Kimball Bus Matrix:
List of Visualizations for each KPI:
- Treemap: percentage of crimes resulting in arrest by location over a time period
- Comparative line plot: total crimes compared to daily temperature over time period
- Navigable map with volume bubbles: number of crimes by location over a time period
- Comparative bar graph/line plot: total crimes compared to graduation rate over a time period
- Comparative line plot with trend lines: total crimes compared to unemployment rate over a time period
Picture of the final BI Dashboard:
The project was deployed on Tableau Public: https://tabsoft.co/3xQjZ9s