BITS (Bias Identification Test in Sentiments)

BITS is a sentence repository test that consists of 2,896 sentences curated to probe sentiment and toxicity analysis models for biases in sociodemographic factors like disability, race and gender.

The dataset is currently divided into three primary facets. They are:

Disability Facet
Race Facet
Gender Facet

As the name explains, each facet contains sentences that are created to check if language models differentiate words based on that particular sociodemographic group it represents. This means that the Disability Facet can be used to check if language models are biased towards people with disability and the Race facet can be used to check if language models are biased towards certain races.

The motivation behind the creation of BITS is to intitiate the first step to eliminate un-intended bias in sentiment analysis, i.e. identification. A model, if used in a social environment, must be aware of the possible biases it might have. The BITS template is created in a way that it can be easily modified or updated to include groups of concern for the usecase it is used in. More details about the creation process can be found in . Do cite the work in case you use this dataset to check for bias in sentiment or toxicity analysis models.

Research Paper: @article{venkit2021identification, title={Identification of bias against people with disabilities in sentiment analysis and toxicity detection models}, author={Venkit, Pranav Narayanan and Wilson, Shomir}, journal={arXiv preprint arXiv:2111.13259}, year={2021} }

Please feel free to email us at pranav.venkit@psu.edu or shomir@psu.edu for additional information regarding this work.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Disability Facet_NonSocial.xlsx		Disability Facet_NonSocial.xlsx
Disability Facet_Social.csv		Disability Facet_Social.csv
Disability_Facet.xlsx		Disability_Facet.xlsx
Disablility_Facet_Results.csv		Disablility_Facet_Results.csv
Gender Facet.xlsx		Gender Facet.xlsx
README.md		README.md
Race Facet.xlsx		Race Facet.xlsx
Standard_Corpus.csv		Standard_Corpus.csv
Template_disability.csv		Template_disability.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disability Facet_NonSocial.xlsx

Disability Facet_NonSocial.xlsx

Disability Facet_Social.csv

Disability Facet_Social.csv

Disability_Facet.xlsx

Disability_Facet.xlsx

Disablility_Facet_Results.csv

Disablility_Facet_Results.csv

Gender Facet.xlsx

Gender Facet.xlsx

README.md

README.md

Race Facet.xlsx

Race Facet.xlsx

Standard_Corpus.csv

Standard_Corpus.csv

Template_disability.csv

Template_disability.csv

Repository files navigation

BITS (Bias Identification Test in Sentiments)

About

Releases

Packages

PranavNV/BITS

Folders and files

Latest commit

History

Repository files navigation

BITS (Bias Identification Test in Sentiments)

About

Resources

Stars

Watchers

Forks