Skip to content

alanakil/NLP_classifying_disaster_tweets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Natural Language Processing with Disaster Tweets

Twitter has become one of the most popular sources of information. In particular, it has become an important communication channel in emergencies that works in real time. Therefore it is of great interest to be able to analyze a tweet and determine if it is reporting an emergency (disaster) or not.

Goal: Read tweets and classify them into 2 classes: disaster or no disaster.

The data set contains more than 10,000 tweets that were hand classified. Link to dataset: https://www.kaggle.com/c/nlp-getting-started/overview

For this project, I applied important parts of a modeling scheme: exploratory data analysis, data cleaning, feature extraction, model fitting, and performance assessment.

Results and code are reported in the Jupyter notebook saved in this repository.

In short, we fit the data to a recurrent neural network (LSTM) which yielded the best results: auc of 0.87.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published