Bias in NLP

This is a collection of natural language processing papers that deal with bias (mostly gender bias). The list is by no means complete and is just a way to keep up with the large amount of papers in that area. If you miss a paper, please add it.

Papers

Towards Detection of Subjective Bias using Contextualized Word Embeddings
WebConf2020 - Paper, Code
Note: Wikineutrality Corpus.

Joint Multiclass Debiasing of Word Embeddings
ISMIS2020 - Paper, Code
Note: Hard and Soft WEAT

Towards Debiasing Sentence Representations
ACL2020 - Paper, Code
Note: Sentence-level debiasing. Difference between pretraining and finetuning.

Neutralizing Gender Bias in Word Embedding with Latent Disentanglement and Counterfactual Generation
arxiv2020 - Paper
Note: Counterfactual generation.

Unsupervised Discovery of Implicit Gender Bias
arxiv2020 - Paper, Code
Note: Unsupervised bias detection from comments.

StereoSet: Measuring stereotypical bias in pretrained language models
arxiv2020 - Paper, Code
Note: Benchmark and Dataset for measuring bias in 4 domains (gender, profession, race, religion).

Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation
ACL2020 - Paper, Code
Note: Double Hard Debias: mitigigate dataset and then do debiasing

Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer
ACL2020 - Paper
Note: Bias in multilingual embeddings depends on the alignment direction.

Scalable Cross Lingual Pivots to Model Pronoun Gender for Translation
arxiv2020 - Paper
Note: Gender labels for pronouns in MT English-Spanish.

Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases
arxiv2020 - Paper
Note: CEAT

OSCaR: Orthogonal Subspace Correction and Rectification of Biases in Word Embeddings
arxiv2020 - Paper
Note: Preserve semantic meaning of embeddings.

Investigating Gender Bias in BERT
arxiv2020 - Paper
Note: Identify one gender direction per BERT layer.

Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender Bias
arxiv2020 - Paper, Code
Note: Multilingual multitask dataset across 4 languages.

Towards Debiasing NLU Models from Unknown Biases
arxiv2020 - Paper, Code
Note: Unsupervised bias detection.

Robustness and Reliability of Gender Bias Assessment in Word Embeddings: The Role of Base Pairs
arxiv2020 - Paper, Code
Note: Choice of base pairs is relevant.

LOGAN: Local Group Bias Detection by Clustering
arxiv2020 - Paper
Note: Identify biases through clustering.

Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation
arxiv2020 - Paper
Note: Verify whether non-linear debiasing helps. It seems not.

Unmasking Contextual Stereotypes: Measuring and Mitigating BERT’s Gender Bias
GeBNLP2020 - Paper, Code
Note: Verify gender debiasing techniques in German.

Language (Technology) is Power: A Critical Survey of “Bias” in NLP
arxiv2020 - Paper
Note: Metastudy: survey of 146 gender bias papers

Pick a Fight or Bite your Tongue: Investigation of Gender Differences in Idiomatic Language Usage
arxiv2020 - Paper
Note: Idiomatic expressions depending on the speaker.

Evaluating Bias In Dutch Word Embeddings
GeBNLP2020 - Paper, Code
Note: Examining bias in Dutch (using WEAT)

Analyzing Gender Bias within Narrative Tropes
arxiv2020 - Paper, Code
Note: Analyze bias using tropes

Neural Machine Translation Doesn’t Translate Gender Coreference Right Unless You Make It
GeBNLP2020 - Paper, Code
Note: Incorporate explicit word-level gender tags.

The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets
NeurIPS 2020 - Paper, Code
Note: Distances in GAP play a role.

AraWEAT: Multidimensional Analysis of Biases in Arabic Word Embeddings
arxiv2020 - Paper, Code
Note: Arabic WEAT.

Characterising Bias in Compressed Models
arxiv2020 - Paper
Note: Bias in compressed model is large. Provide method to identify biased examples.

Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them
NAACL2019 - Paper, Code
Note: Debiasing by setting dimensions to zero ist not effective

Equalizing Gender Bias in Neural Machine Translation with Word Embeddings Techniques
GeBNLP 2019 - Paper
Note: Spanisch-Englisch translation with occupations.

Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
GeBNLP 2019 - Paper
Note: Cointextualized embeddings are less biased than static ones.

Mitigating Gender Bias in Natural Language Processing: Literature Review
ACL2019 - Paper
Note: Survey

What's in a Name? Reducing Bias in Bios without Access to Protected Attributes
NAACL2019 - Paper
Note: Work on biographies.

Assessing Social and Intersectional Biases in Contextualized Word Representations
NeurIPS2019 - Paper
Note: Strong bias in contextualized embeddings. Bias not always visible on sentence level.

It’s All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution
EMNLP2019 - Paper
Note: Counterfactual Data Substitution (CDS)

Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis
GeBNLP 2019 - Paper, Code
Note: Dataset of 800 sentences analysed with sentiment analysis.

Automatic Gender Identification and Reinflection in Arabic
GeBNLP 2019 - Paper
Note: Arabic English Translation with focus on getting the pronouns right.

Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence Pooling
GeBNLP 2019 - Paper, Code
Note: Shared task winner GAP

Gendered Ambiguous Pronouns (GAP) Shared Task at the Gender Bias in NLP Workshop 2019
GeBNLP 2019 - Paper, Code
Note: GAP shared task description

Conceptor Debiasing of Word Representations Evaluated on WEAT
GeBNLP 2019 - Paper
Note: Proposes Conceptor Debiasing.

On Measuring Gender Bias in Translation of Gender-neutral Pronouns
GeBNLP 2019 - Paper, Code
Note: Gender bias in pronoun translation Korean-English

Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word Categories
GeBNLP 2019 - Paper, Code
Note: Clustering method for discovering new biases.

The Role of Protected Class Word Lists in Bias Identification of Contextualized Word Representations
GeBNLP 2019 - Paper
Note: Uses conceptor debiasing

The Woman Worked as a Babysitter: On Biases in Language Generation
EMNLP2019 - Paper, Code
Note: Regard and Sentiment. Annotations released.

Exploring Human Gender Stereotypes with Word Association Test
EMNLP2019 - Paper, Code
Note: Word association graphs

Gender-preserving Debiasing for Pre-trained Word Embeddings
ACL2019 - Paper, Code
Note: Differentiate between bias and gender information.

Quantifying Social Biases in Contextual Word Representations
GeBNLP 2019 - Paper
Note: Template based method to quantify bias.

Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting
ACM Fat 2019 - Paper
Note: Analyze effects of bias.

Gender Bias in Neural Natural Language Processing
Logic, Language, and Security. Springer. 2018 - Paper
Note: Counterfactual Data Augmentation (CDA). Clear definition of Bias. Evaluates on coreference resolution and language modelling.

Gender Bias in Coreference Resolution
NAACL2018 - Paper, Code
Note: Windogender schemes.

Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings
NeurIPS2016 - Paper, Code
Note: Among the first to address gender bias

Rejecting the Gender Binary: A Vector-Space Operation.
2015 - Paper
Note: Blog post: first to propose to remove gender dimension

TODOS

add https://arxiv.org/pdf/2011.12086.pdf
add https://arxiv.org/pdf/2011.12096.pdf
add papers from GeBNLP2020 once they are available.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
update.py		update.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

update.py

update.py

Repository files navigation

Bias in NLP

Papers

About

Releases

Packages

Languages

cisnlp/bias-in-nlp

Folders and files

Latest commit

History

README.md

README.md

update.py

update.py

Repository files navigation

Bias in NLP

Papers

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages