Spam-Filter

Detect toxic/offensive messages using various classification techniques

The dataset used to train the models can be downloaded from https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data

This repository contains two approaches to deal with toxic comments.

The first approach is to use three classification models, including logistic regression, gardient boosting trees, and multilayer perceptron models to train each type of toxic comments (toxic, severe toxic,obscene, threat, insult, and identity hate) seperately. The resutls after testing the models are summarized in the Accuracy.csv file.

The second approach is to use a LSTM model. WARNING: The visualizations contain offensive words

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
API		API
Visualization		Visualization
data		data
Classifier.py		Classifier.py
README.md		README.md
traditional_classifier.py		traditional_classifier.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API

API

Visualization

Visualization

data

data

Classifier.py

Classifier.py

README.md

README.md

traditional_classifier.py

traditional_classifier.py

Repository files navigation

Spam-Filter

About

Releases

Packages

Languages

hiepnguyen034/Toxic-Comments-Classifer

Folders and files

Latest commit

History

Repository files navigation

Spam-Filter

About

Topics

Resources

Stars

Watchers

Forks

Languages