Natural Language Processing with Disaster Tweets

Twitter has become one of the most popular sources of information. In particular, it has become an important communication channel in emergencies that works in real time. Therefore it is of great interest to be able to analyze a tweet and determine if it is reporting an emergency (disaster) or not.

Goal: Read tweets and classify them into 2 classes: disaster or no disaster.

The data set contains more than 10,000 tweets that were hand classified. Link to dataset: https://www.kaggle.com/c/nlp-getting-started/overview

For this project, I applied important parts of a modeling scheme: exploratory data analysis, data cleaning, feature extraction, model fitting, and performance assessment.

Results and code are reported in the Jupyter notebook saved in this repository.

In short, we fit the data to a recurrent neural network (LSTM) which yielded the best results: auc of 0.87.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.DS_Store		.DS_Store
README.md		README.md
classifying_disaster_tweets.ipynb		classifying_disaster_tweets.ipynb
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

.DS_Store

.DS_Store

README.md

README.md

classifying_disaster_tweets.ipynb

classifying_disaster_tweets.ipynb

train.csv

train.csv

Repository files navigation

Natural Language Processing with Disaster Tweets

About

Releases

Packages

Languages

alanakil/NLP_classifying_disaster_tweets

Folders and files

Latest commit

History

Repository files navigation

Natural Language Processing with Disaster Tweets

About

Resources

Stars

Watchers

Forks

Languages