Disaster Response Pipeline Project

demo.mp4

Installation

run pip install -r requirements.txt to install all required libs.

The code should run with no issues using Python versions 3.*.

Project Description

This projects aims to build a model for an API that classifies disaster messages, so that they can be sent to the appropriate disaster relief agency.

When a disaster happens is when the agencies are flooded with messages and it's also when they have the least capacity to deal with it.

The goal is to speed up the process of recognizing important messages and redirecting them correctly.

This project is composed of the following steps:

Take real data from tweets and text messages sent during real life disaster events;
Prepare this data with an ETL Pipeline;
Build a Machine Learning Pipeline to classify new messages on future disaster events so that the messages can be sent to the appropriate disaster relief agency.

This project includes a web app where an emergency worker can input a new message and get classification results in several categories.

File Descriptions

Below are additional details about the project structure:

/app : contains the Flask webapp files.
/data : contains both .csv files used on the ETL pipeline as well as the process_data.py script that holds all the ETL pipeline and the .db result from the ETL pipeline.
/models : contains the train_classifier.py script that holds the ML pipeline as well as the model pickle file.
/notebooks contains Jupyter Notebooks that were used to build both pipeline scripts.

Instructions

Run the following commands in the project's root directory to set up your database and model.
- To run ETL pipeline that cleans data and stores in database
  
  python data/process_data.py data/disaster_messages.csv data/disaster_categories.csv data/disastermanegement.db
- To run ML pipeline that trains classifier and saves
  
  python models/train_classifier.py data/disastermanagement.db models/message_lr_classifier.pkl
Go to app directory: cd app
Run the web app: python run.py
Go to http://localhost:3003/

Licensing, Authors, and Acknowledgements

Licensing

MIT license

Authors

Marina Villaschi

Acknowledgements:

Appen (formally Figure 8) for providing the pre-labeled data.

Udacity and all staff involved for the great guidance and quality course material provided during the Data Science Nanodegree Program.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
app		app
assets		assets
data		data
models		models
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
license.txt		license.txt
nltk.txt		nltk.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app

app

assets

assets

data

data

models

models

notebooks

notebooks

.gitignore

.gitignore

README.md

README.md

license.txt

license.txt

nltk.txt

nltk.txt

requirements.txt

requirements.txt

Repository files navigation

Disaster Response Pipeline Project

Table of contents:

Installation

Project Description

File Descriptions

Instructions

Licensing, Authors, and Acknowledgements

Licensing

Authors

Acknowledgements:

About

Releases

Packages

Languages

License

marinavillaschi/disaster-response-pipeline

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Pipeline Project

Table of contents:

Installation

Project Description

File Descriptions

Instructions

Licensing, Authors, and Acknowledgements

Licensing

Authors

Acknowledgements:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages