Skip to content

Machine Learning project to predict Idaho wildfires.

Notifications You must be signed in to change notification settings

vc-adams/Idaho_Wildfires_Dashboard

 
 

Repository files navigation

Group 7, Project 3 – US Wildfires Database; Machine Learning Expansion Project

Members: Kaylon Young, Vincent Adams, Emeka Obianyor, Rob Pascarella

Abstract:

--Excerpt from Project 2--
Wildfires don’t stop where the wilderness ends. They burn through communities and neighborhoods, destroying property and taking lives. With climate-driven increases in wildfires in the U.S., it is imperative to understand how the risk to homes is also changing nationwide. The convergence of warmer, drier conditions and greater development into flammable landscapes is leaving many communities vulnerable to human-caused wildfires.

200911-wildfire-california-worst-widlfire-year-se-236p_f66f4d02e7ec04cbe3ed0c47dfc8037e

The objective of this project was to extract and explore a publicly available spatial database of wildfires that occurred in the United States from 1992 to 2015 and includes 1.88 million geo-referenced wildfire records, representing a total of 140 million acres burned during the 24-year period.
--

As explained above, from Project 2, large class wildfire data (Class G only - the largest fires which engulf 5000+ acres) was explored and visualized using a plethora of tools learned in the classroom. Project 3 aims to expand on that idea: We can combine it with hisotical weather data and draught data backed by the power of a machine learning model in an attempt to categorize a risk score for the upcoming wildfire season, and/or instance, for the user!

We are testing Multiple Linear Regression, Support Vecotor Machine and Neural Network methods to answuch questions with a machine learning project as:
• Can we predict the size, or in essence give a risk score - for a certain area and data given user-specified inputs?
• What areas of the state [note: to limit the dataset we are utlizing on Idaho data] are the most and least fire-prone?
• Seasonal impact on size and frequency?

Resources:

The Resourced that are used in this project include:

  1. 1.88 Million US Wildfires available from Kaggle
    • Data for the project will be refined given the number and volume of data exceeding 1,213,546 fires and burning 140,132,509 acres over a 24 year period.
    • This data set is massive. We've paired the project down to simply assess the state of Idaho.
  2. Weather data API: https://weatherstack.com/
  3. Weather data via Python Libary Meteostat: https://dev.meteostat.net/python/stations.html
  4. Drought data shapefiles: https://droughtmonitor.unl.edu/data/GISData.aspx

Subcomponents of assembling the project include:
• API call from weatherstack.com were initially utilized, until Meteostat ease of use prevailed.
• CityPy homework example was utilized to located nearest city for all data.
• Drought data shapefiles were merged.
• Weather data was combined with drought data and merged on dates and location.

Recreating Application

  1. Clone contents of the github repo.
  2. Using gitbash or terminal, navigate to the project directory in which you have copied, and run python app.py.
  3. Open a browswer window and enter localhost:5000 port. The applciation should run.

Application Deployed on Heroku

This project has been deployed on Heroku: https://wildfires-part2-app.herokuapp.com/

Example of Figures

  1. Tableau was utilized for aesthetic appeal and interactivity of data.
    image
  2. Leaflet map provides geograpical interactive component.
  3. User inputs allow tool to predict the most statisically viable fire size given the data for current location, etc.
    image