SpamClassifier

Model Deploy Using Flask on Heroku Platform

https://spam-email-classification.herokuapp.com/

In this project I build a model for classifying the SMS/Email into spam or ham through the text of the SMS/Email using standard classifiers.

What It Does:

Live Demo:

How It Does:

Extract the text and the target class from the dataset. Extract the features of the test using TF IDF vectorizer for the Input features.Split the skewed data into shuffled sets using stratified shuffle split in sklearn library. Use standard classifiers to classify the data into spam or ham.

Prerequisites:

I would highly recommend that before the hack night you have some kind of toolchain and development environment already installed and ready. If you have no idea where to start with this, try a combination like:

Python
scikit-learn / sklearn \
Pandas
nltk
NumPy
matplotlib
An environment to work in - something like Jupyter or Spyder For Linux people, your package manager should be able to handle all of this. If it somehow can't, see if you can at least install Python and pip and then use pip to install the abovepackages.

Dataset:

The SMS/Email Spam Collection is a set of SMS tagged messages that have been collected for SMS/Email Spam research. It contains one set of SMS messages in English of 5,567 messages, tagged according being ham (legitimate) or spam.

You can collect raw dataset from here .The files contain one message per line. Each line is composed by two columns:

Class- contains the label (ham or spam)
Message - contains the raw text.

ModelPipeline:

Components:

Using TF-IDF for feature extraction of the text data for the messages.
Use splits for skewed data(Since the number of ham are far more than the number of spam messages,the data is skewed)
Use stratified shuffled split for the split of skewed data.
Use different standard classifiers for classification of the SMS.
Compare the accuracy of various classifiers using standard classification metrics

AccuracyResult:

Future Scope:

Adding this feature in a dynamic website which supports contact-us typo feature.
Show live user inputs for Ham and Spam .

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Screenshots		Screenshots
static		static
templates		templates
Email Spam Filter.ipynb		Email Spam Filter.ipynb
NB_spam_model.pkl		NB_spam_model.pkl
Procfile		Procfile
README.md		README.md
app.py		app.py
datasets_483_982_spam.csv		datasets_483_982_spam.csv
nltk.txt		nltk.txt
requirements.txt		requirements.txt

bharatc9530/Spam-Email-Classification

Folders and files

Latest commit

History

Repository files navigation

SpamClassifier

Model Deploy Using Flask on Heroku Platform

In this project I build a model for classifying the SMS/Email into spam or ham through the text of the SMS/Email using standard classifiers.

What It Does:

Live Demo:

How It Does:

Prerequisites:

Dataset:

ModelPipeline:

Components:

AccuracyResult:

Future Scope:

About

Topics

Resources

Stars

Watchers

Forks

Languages