Toxic_Comment_Classification_using_Keras

Identify and classify toxic online comments

Discussing things you care about can be difficult. The threat of abuse and harassment online means that many people stop expressing themselves and give up on seeking different opinions. Platforms struggle to effectively facilitate conversations, leading many communities to limit or completely shut down user comments.

The Conversation AI team,

a research initiative founded by Jigsaw and Google (both a part of Alphabet) are working on tools to help improve online conversation. One area of focus is the study of negative online behaviors, like toxic comments (i.e. comments that are rude, disrespectful or otherwise likely to make someone leave a discussion). So far they’ve built a range of publicly available models served through the Perspective API, including toxicity. But the current models still make errors, and they don’t allow users to select which types of toxicity they’re interested in finding (e.g. some platforms may be fine with profanity, but not with other types of toxic content).

https://perspectiveapi.com/#/

Here we will build a multi-headed model that’s capable of detecting different types of of toxicity like threats, obscenity, insults, and identity-based hate better than Perspective’s current models. You’ll be using a dataset of comments from Wikipedia’s talk page edits. Improvements to the current model will hopefully help online discussion become more productive and respectful.

Current Work @Google : https://github.com/conversationai/unintended-ml-bias-analysis

DataSet : https://github.com/conversationai/unintended-ml-bias-analysis/tree/master/data

GloVe: Global Vectors for Word Representation : https://nlp.stanford.edu/projects/glove/

We will use these pre-trained embeddings when we need a way to quantify word co-occurrence (which also captures some aspects of word meaning.)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Identify and classify toxic online comments.ipynb		Identify and classify toxic online comments.ipynb
LICENSE		LICENSE
README.md		README.md
Solve.md		Solve.md
model_BiDir_LSTM.h5		model_BiDir_LSTM.h5
new_model_BiDir_LSTM.weights.h5		new_model_BiDir_LSTM.weights.h5
test.csv		test.csv
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

Identify and classify toxic online comments.ipynb

Identify and classify toxic online comments.ipynb

LICENSE

LICENSE

README.md

README.md

Solve.md

Solve.md

model_BiDir_LSTM.h5

model_BiDir_LSTM.h5

new_model_BiDir_LSTM.weights.h5

new_model_BiDir_LSTM.weights.h5

test.csv

test.csv

train.csv

train.csv

Repository files navigation

Toxic_Comment_Classification_using_Keras

About

Releases

Packages

Languages

License

irfanalidv/Toxic_Comment_Classification_using_Keras

Folders and files

Latest commit

History

Repository files navigation

Toxic_Comment_Classification_using_Keras

About

Topics

Resources

License

Stars

Watchers

Forks

Languages