GitHub - nesl/nlp_adversarial_examples: Implementation code for the paper "Generating Natural Language Adversarial Examples"

Download the Imdb dataset

./download_dataset.sh

Download the glove vector embeddings (used by the model)

 ./download_glove.sh

Download the counter-fitted vectors (used by our attack)

./download_counterfitted_vectors.sh

Build the vocabulary and embeddings matrix.

python build_embeddings.py

That will take like a minute, and it will tokenize the dataset and save it to a pickle file. It will also compute some auxiliary files like the matrix of the vector embeddings for words in our dictionary. All files will be saved under aux_files directory created by this script.

Train the sentiment analysis model.

python train_model.py

6)Download the Google language model.

./download_googlm.sh

Pre-compute the distances between embeddings of different words (required to do the attack) and save the distance matrix.

python compute_dist_mat.py

Now, we are ready to try some attacks ! You can do so by running the IMDB_AttackDemo.ipynb Jupyter notebook !

Attacking Textual Entailment model

The model we are using for our experiment is the SNLI model of Keras SNLI Model .

First, Download the dataset using

bash download_snli_data.sh

Download the Glove and Counter-fitted Glove embedding vectors

bash ./download_glove.sh
bash ./download_counterfitted_vectors.sh

Train the NLI model

python sni_rnn.py

Pre-compute the embedding matrix

python nli_compute_dist_matrix.py

Now, you are ready to run the attack using example code provided in NLI_AttackDemo.ipynb Jupyter notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
BaselineDemo.ipynb		BaselineDemo.ipynb
IMDB_AttackDemo.ipynb		IMDB_AttackDemo.ipynb
LICENSE		LICENSE
NLI_AttackDemo.ipynb		NLI_AttackDemo.ipynb
README.md		README.md
VisualizeResults.ipynb		VisualizeResults.ipynb
attacks.py		attacks.py
build_embeddings.py		build_embeddings.py
compute_dist_mat.py		compute_dist_mat.py
data_utils.py		data_utils.py
display_utils.py		display_utils.py
download_counterfitted_vectors.sh		download_counterfitted_vectors.sh
download_dataset.sh		download_dataset.sh
download_glove.sh		download_glove.sh
download_googlm.sh		download_googlm.sh
download_snli_data.sh		download_snli_data.sh
glove_utils.py		glove_utils.py
goog_lm.py		goog_lm.py
lm_data_utils.py		lm_data_utils.py
lm_utils.py		lm_utils.py
models.py		models.py
nli_compute_dist_matrix.py		nli_compute_dist_matrix.py
snli_rnn.py		snli_rnn.py
train_model.py		train_model.py

License

nesl/nlp_adversarial_examples

Folders and files

Latest commit

History

Repository files navigation

Attacking Textual Entailment model

About

Resources

License

Stars

Watchers

Forks

Languages