Skip to content

tweichle/Classifying-Criminal-Offenses

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 

Repository files navigation

Classifying-Criminal-Offenses

Classification Application in Python Using scikit-learn

(PySpark program included to show data exploration/manipulation and descriptive statistics analyses)

This repository contains the prediction of more serious crimes using Chicago crime data accessed via Google BigQuery Storage API.

iStock_83109979_LARGE

Goals

  • Using Chicago Police Department crime data from 2001 to present, summarize and examine crime statistics.

  • Build and train Classification models to predict index offenses (more serious crimes).

    • Compare performance of various classification techniques including logistic regression, random forests, support vector machines, and XGBoost.
    • Apply regularization and cross-validation techniques for model evaluation, selection, and optimization.