Skip to content

ialonsolinares/Data-Ingestion-and-Analysis-using-UK-Police-Crime-API

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data-Ingestion-and-Analysis-using-UK-Police-Crime-API

Hello! Please refer to the MDA (Modern Data Architectures) PDF.

url

This is a lab to understand batch data ingestion, while using the OSBDET environment. This is a virtual machine project which packages many big data open source projects such as Hadoop, Spark, Kafka or Nifi. https://github.com/raulmarinperez/osbdet

In this case we will be using the configuration file .xml as a template to gather the data from the UK Police Crime Free Rest-API.

Methodology

  1. Import the .xml configuration template file to Nifi
  2. Create a HDFS directory to safe the files
  3. Run the NiFi Flow to gather the data
  4. Run the .ipynb to watch the data being analysed.

BAM! That's it! I recommend you check the PDF, it is an easy way to look at the analysis without having to replicate all the steps.

About

Data Ingestion Lab with Nifi and Data Analysis on PySpark and Pandas of UK Police Crime API

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published