Readme

UPDATE -- 14.2.23

We added streamlined evaluation scripts for the replication of our word sense disambiguation experiments detailed in Sections 2 and 3 of the paper. These can be found in the benchmark_evaluation_scripts directory.

This repository contains the experimental code for the publication Detecting Word Sense Disambiguation Biases in Machine Translation for Model-Agnostic Adversarial Attacks (Emelin, Denis, Ivan Titov, and Rico Sennrich, EMNLP 2020).

The readme is meant to provide an overview of the functionality of the different scripts included in this codebase and their relation to the paper’s contents. For the required and optional arguments of each script, please run python3 script_name.py -h. References to relevant paper sections (:blue_book:) are given in bold.

Requirements

python 3.x
nltk
numpy
spacy
langid
pandas
sklearn
torch
pingouin
babelnetpy
transformers
tensorflow_hub
language_tool_python
sentence_transformers

Resource collection

resource_collection/clean_corpora.py
Cleans the raw parallel corpora.
📘 See Appendix A1 for details.

resource_collection/scrape_babelnet.py
Collects sense clusters for English homographs from BabelNet and refines them by applying filtering heuristics.
📘 See Section 2.1, Resource collection for details.

resource_collection/remove_sense_duplicates.py
Removes sense duplicates from collected BabelNet sense clusters.

resource_collection/extract_attractor_terms.py
Extracts attractor terms from specified training corpora, assigning them to corresponding homograph senses clusters, and computes their disambiguation bias values.
📘 See Section 2.1 for details.

resource_collection/extract_seed_pairs.py
Extracts seed sentences containing homographs from held-out and test corpora for the benchmarking of WSD error prediction performance and the generation of adversarial samples.
📘 See Section 2.2 for details.

resource_collection/extract_homograph_modifiers.py
Extracts adjectives observed to modify known homograph senses in the English portion of the training corpora, used to constrain the generation of adversarial samples.
📘 See Section 3.1, Attractor selection for details.

resource_collection/extract_non_homograph_modifiers.py
Extracts adjectives observed to modify non-homograph nouns in a specified monolingual English corpus, used to constrain the generation of adversarial samples.
📘 See Section 3.1, Attractor selection for details.

Adversarial sample generation

adversarial_sample_generation/generate_adversarial_samples.py
Generates adversarial samples by applying the proposed perturbations to seed sentences and running various filtering heuristics to ensure sample quality.
📘 See Section 3.1 for details.

adversarial_sample_generation/score_seeds_with_bert.py
Identifies and removes seed sentences containing ambiguous homograph mentions.
📘 See Section 3.1, Seed sentence selection for details.

adversarial_sample_generation/score_samples_with_lm.py
Estimates sentence perplexity increases in adversarial samples relative to their respective seed sentences.
📘 See Section 3.1, Post-generation filtering for details.

Evaluation

evaluation/evaluate_attack_success.py
Checks whether unperturbed sentences are translated correctly (see Section 2.2 for details) and whether adversarial attacks are successful.
📘 See Sections 2.2 and 3.2 for details.

evaluation/check_challenge_overlap.py
Computes the overlap between WSD error prediction challenge sets.
📘 See Section 2.2, Challenge set evaluation for details.

evaluation/check_sample_transferability.py
Computes the Jaccard similarity index between several sets of successful adversarial samples.
📘 See Section 4 for details.

evaluation/create_human_annotation_forms.py
Generates forms used in the human evaluation of sample ambiguity and naturalness.
📘 See Section 3.3 for details.

evaluation/evaluate_human_annotation.py
Evaluates the judgments collected from human annotators and computes inter-annotator agreement scores.
📘 See Section 3.3 for details.

evaluation/evaluate_perturbation_efficacy.py
Estimate the correlations between successes of adversarial attacks and the perturbation types used to generate them.
See Section 3.2 for details.

evaluation/check_grammaticality_preservation.py
Detects grammar errors in seed sentences and adversarial samples for measuring grammaticality degradation after adversarial perturbation.
📘 See Section 3.3 for details.

evaluation/generate_adversarial_challenge_set.py
Creates the adversarial challenge set.
📘 See Section 3.2, Challenge set evaluation for details.

evaluation/generate_wsd_challenge_set.py
Creates the WSD error prediction challenge set based on sentence-level disambiguation bias scores.
📘 See Section 2.2, Challenge set evaluation for details.

evaluation/generate_wsd_challenge_set_from_homographs.py
Create the WSD error prediction challenge set based on homograph sense cluster frequency.
📘 See Section 2.2, Challenge set evaluation for details.

evaluation/test_attractor_correlations.py
Calculates correlation scores based on attractor-specific disambiguation bias values.
📘 See Section 3.2 for details.

evaluation/test_seed_sentence_correlations.py
Calculates correlation scores based on sentence-level disambiguation bias values.
📘 See Section 2.2 for details.

evaluation/test_homograph_correlations.py
Calculates correlation scores based on homograph sense cluster frequency.
📘 See Section 2.2 for details.

evaluation/write_adversarial_tables_to_text.py
Writes adversarial samples to a plain text file to be used as input to baseline NMT models.

Resources

./ende_homograph_sense_clusters.json
Contains the manually refined homograph sense cluster lexicon used in all experiments.
📘 See Section 2.1, Resource collection for details.

Citation

@inproceedings{Emelin2020DetectingWS,
  title={Detecting Word Sense Disambiguation Biases in Machine Translation for Model-Agnostic Adversarial Attacks},
  author={Denis Emelin and Ivan Titov and Rico Sennrich},
  booktitle={EMNLP},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
adversarial_sample_generation		adversarial_sample_generation
benchmark_evaluation_scripts		benchmark_evaluation_scripts
evaluation		evaluation
resource_collection		resource_collection
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ende_homograph_sense_clusters.json		ende_homograph_sense_clusters.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial_sample_generation

adversarial_sample_generation

benchmark_evaluation_scripts

benchmark_evaluation_scripts

evaluation

evaluation

resource_collection

resource_collection

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

ende_homograph_sense_clusters.json

ende_homograph_sense_clusters.json

requirements.txt

requirements.txt

Repository files navigation

Readme

UPDATE -- 14.2.23

Requirements

Resource collection

Adversarial sample generation

Evaluation

Resources

Citation

About

Releases

Packages

Languages

License

demelin/detecting_wsd_biases_for_nmt

Folders and files

Latest commit

History

Repository files navigation

Readme

UPDATE -- 14.2.23

Requirements

Resource collection

Adversarial sample generation

Evaluation

Resources

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages