Detection of Unintended Toxicity in the social comments

Description¶

The Conversation AI team, a research initiative founded by Jigsaw and Google (both part of Alphabet), builds technology to protect voices in conversation. A main area of focus is machine learning models that can identify toxicity in online conversations, where toxicity is defined as anything rude, disrespectful or otherwise likely to make someone leave a discussion. Here’s the background: When the Conversation AI team first built toxicity models, they found that the models incorrectly learned to associate the names of frequently attacked identities with toxicity. Models predicted a high likelihood of toxicity for comments containing those identities (e.g. "gay"), even when those comments were not actually toxic (such as "I am a gay woman"). This happens because training data was pulled from available sources where unfortunately, certain identities are overwhelmingly referred to in offensive ways. Training a model from data with these imbalances risks simply mirroring those biases back to users

Please find more details of the problem at Reference: https://www.kaggle.com/c/jigsaw-unintended-bias-in-toxicity-classification/overview/timeline

Description of the files:

"Part1_EDA_FE.ipynb"-- this file contains all Exploratory Data Analysis and Feature Engineering
"part2_Applying_ML_algos.ipynb"--In this file some Machine Learing algorithms are appiled on preprocessed data
"part_3_DL_algos.ipynb"-- In this file both Bidirectional LSTM/GRU are applied.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Part1_EDA_FE.ipynb		Part1_EDA_FE.ipynb
Part_3_finalized_LSTM and GRU.ipynb		Part_3_finalized_LSTM and GRU.ipynb
Part_3_finalized_LSTM_and_GRU.ipynb		Part_3_finalized_LSTM_and_GRU.ipynb
README.md		README.md
part2_Applying_ML_algos.ipynb		part2_Applying_ML_algos.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Part1_EDA_FE.ipynb

Part1_EDA_FE.ipynb

Part_3_finalized_LSTM and GRU.ipynb

Part_3_finalized_LSTM and GRU.ipynb

Part_3_finalized_LSTM_and_GRU.ipynb

Part_3_finalized_LSTM_and_GRU.ipynb

README.md

README.md

part2_Applying_ML_algos.ipynb

part2_Applying_ML_algos.ipynb

Repository files navigation

Detection of Unintended Toxicity in the social comments

Description¶

Description of the files:

About

Releases

Packages

Languages

pothabattulasantosh/Detection-of-Unintended-Toxicity-in-the-social-comments

Folders and files

Latest commit

History

Repository files navigation

Detection of Unintended Toxicity in the social comments

Description¶

Description of the files:

About

Topics

Resources

Stars

Watchers

Forks

Languages