LISA: Linguistically-Informed Self-Attention

This is the original implementation of the linguistically-informed self-attention (LISA) model described in the following paper:

Emma Strubell, Patrick Verga, Daniel Andor, David Weiss, and Andrew McCallum. Linguistically-Informed Self-Attention for Semantic Role Labeling. Conference on Empirical Methods in Natural Language Processing (EMNLP). Brussels, Belgium. October 2018.

This code is based on a fork of Timothy Dozat's open source graph-based dependency parser, and the code to run ELMo is copied from AI2's TensorFlow implementation. Thanks Tim and AI2!

This code is released for exact replication of the paper. You can find a work-in-progress but vastly improved re-implementation of LISA here.

Requirements:

Python 2.7
>= TensorFlow 1.1
h5py (for ELMo models)

Quick start:

Data setup (CoNLL-2005, GloVe):

Get pre-trained word embeddings (GloVe):

wget -P data http://nlp.stanford.edu/data/glove.6B.zip
unzip -j data/glove.6B.zip glove.6B.100d.txt -d data/glove

Get CoNLL-2005 data in the right format using this repo. Follow the instructions all the way through preprocessing for evaluation.
Make sure data_dir is set correctly, to the root directory of the data, in any config files you wish to use below.

Download and evaluate a pre-trained model:

Pre-trained models are available for download via Google Drive here.

To download the model from Google Drive on the command line, you can use the following bash function (from here):

function gdrive_download () {
  CONFIRM=$(wget --quiet --save-cookies /tmp/cookies.txt --keep-session-cookies --no-check-certificate "https://docs.google.com/uc?export=download&id=$1" -O- | sed -rn 's/.*confirm=([0-9A-Za-z_]+).*/\1\n/p')
  wget --load-cookies /tmp/cookies.txt "https://docs.google.com/uc?export=download&confirm=$CONFIRM&id=$1" -O $2
  rm -rf /tmp/cookies.txt
}

The gdrive_download function takes two parameters: Google Drive ID and filename. You can find model names and their corresponding ids in this table:

Model	ID
`sa-conll05`	`1OygfdFs5Q8Fn1MN8oLnwVhIqCgCZBx11`
`lisa-conll05`	`1XyfyjjjQVJK16XhqHoY9GK8GDOqZX2qk`
`sa-conll12`	`1cpveFb380WZoTZ_VpJSRqUzupGODX40A`
`lisa-conll12`	`1V9-lB-TrOxvProqiJ5Tx1cFUdHkTyjbw`

`sa-conll05-gold-preds`	`1qMUXUZwqzygHqYJq_8jqm_maniHg3POK`
`lisa-conll05-gold-preds`	`1rwazpyIqIOIfiNqca1Tk5MEA5qM6Rhf1`

`sa-conll05-elmo`	`1RX-1YlHQPLRiJGzKF_KHjbNiVCyICRWZ`
`lisa-conll05-elmo`	`1J5wrZUQ7UIVpbZjd7RtbO3EcYE5aRBJ7`
`sa-conll12-elmo`	`1KBaPr7jwXYuOlBDBePrSiohG-85mv8qJ`
`lisa-conll12-elmo`	`1Qvh4WHX7u_UaLt-QAZcCGbo0uOlvG-EN`

To download and evaluate e.g. the lisa-conll05 model using GloVe embeddings:

mkdir -p models
gdrive_download 1XyfyjjjQVJK16XhqHoY9GK8GDOqZX2qk models/lisa-conll05.tar.gz
tar xzvf models/lisa-conll05.tar.gz -C models
python network.py --load --test --test_eval --config_file models/lisa-conll05/config.cfg

To evaluate it on the Brown test set:

python network.py --load --test --test_eval --load_dir models/lisa-conll05 --config_file models/lisa-conll05/config.cfg --test_file $CONLL05/test.brown.gz.parse.sdeps.combined.bio --gold_test_props_file $CONLL05/conll2005-test-brown-gold-props.txt --gold_test_parse_file $CONLL05/conll2005-test-brown-gold-parse.txt

Evaluating with ELMo embeddings:

First, download the pre-trained ELMo model and options into a directory called elmo_model:

wget -P elmo_model https://s3-us-west-2.amazonaws.com/allennlp/models/elmo/2x4096_512_2048cnn_2xhighway/elmo_2x4096_512_2048cnn_2xhighway_weights.hdf5
wget -P elmo_model https://s3-us-west-2.amazonaws.com/allennlp/models/elmo/2x4096_512_2048cnn_2xhighway/elmo_2x4096_512_2048cnn_2xhighway_options.json

In order for the model to fit in memory you may want to reduce max_characters_per_token to 25 in the options file:

sed 's/"max_characters_per_token": [0-9][0-9]*/"max_characters_per_token": 25/' elmo_model/elmo_2x4096_512_2048cnn_2xhighway_options.json

Run the D&M+ELMo parser

Code for running the D&M+ELMo parser that we use in our paper can be found in the elmo-parser-stable branch of this repo:

git checkout elmo-parser-stable

If you haven't already, download the pre-trained ELMo model.

Pre-trained parser models are available via Google Drive. Here are the corresponding ids if you wish to download via command line:

Model	ID
`dm-conll05-elmo`	`1uVJna6ddiJCWelU384ssJrnBJGWlPUML`
`dm-conll12-elmo`	`1dxW4D8xu-WHg-mbe5oGx8DUx22WY8228`

To download and evaluate e.g. the dm-conll05-elmo parser, you can do the following:

git checkout elmo-parser-stable
mkdir -p models
gdrive_download 1uVJna6ddiJCWelU384ssJrnBJGWlPUML models/dm-conll05-elmo.tar.gz
tar xzvf models/dm-conll05-elmo.tar.gz -C models
python network.py --load --test --test_eval --config_file models/dm-conll05-elmo/config.cfg

Evaluate LISA using the D&M parse

Once you've run the D&M+ELMo parser, you may want to provide that parse to LISA. You can do this by: (1) creating a new test file that contains the D&M predicted parses rather than gold parses, then (2) providing this new test file to LISA and passing a command line flag which instructs LISA to predict using the provided parse information (rather than its own predicted parses, which is the default behavior).

To create a new test file, first run the parser with the test file for which you would like to predict parses. If you ran the parser under models/dm-conll05-elmo, then there should be a new file generated: models/dm-conll05-elmo/parse_preds.tsv. Below we assume the original test file resides under the directory defined in the environment variable $CONLL05. To create a new test file with parse heads and labels replaced with the predicted ones, you can use the following script:

./bin/replace_gold_parse.sh $CONLL05/test.wsj.gz.parse.sdeps.combined.bio models/dm-conll05-elmo/parse_preds.tsv > test.wsj.gz.parse.sdeps.combined.bio.predicted

Now, you can run LISA evaluation using this test file instead of the original one, and tell LISA to use gold parses:

python network.py --load --test --test_eval --config_file models/lisa-conll05/config.cfg --test_file test.wsj.gz.parse.sdeps.combined.bio.predicted --gold_attn_at_train False

Note that when you change the command line params (as above), config.cfg will be rewritten with the latest configuration settings. You can change this by specifying a directory to --save_dir which differs from --load_dir. (For simplicity, in all the above examples they are the same.)

Evaluate LISA using the gold parse

Similarly, you can easily evaluate LISA using the original, gold parses. To do so on the dev set only, omit the --test_eval flag:

python network.py --load --test --gold_attn_at_train False --config_file models/lisa-conll05/config.cfg

Train a model:

We highly recommend using a GPU.

To train a model with save directory model using the configuration file lisa-conll05.conf:

python network.py --config_file config/lisa-conll05.conf --save_dir model

Results

CoNLL-2005 results for released models w/ predicted predicates (dev, WSJ test, Brown test):

Model	P	R	F1	P	R	F1	P	R	F1
`sa-conll05`	83.52	81.28	82.39	84.17	83.28	83.72	72.98	70.10	71.51
`lisa-conll05`	83.10	81.39	82.24	84.07	83.16	83.61	73.32	70.56	71.91
`lisa-conll05` +D&M	84.44	82.89	83.66	85.98	84.85	85.41	75.93	73.45	74.67
`lisa-conll05` +Gold	87.91	85.73	86.81	---	---	---	---	---	---

`sa-conll05-elmo`	85.78	84.74	85.26	86.21	85.98	86.09	77.10	75.61	76.35
`lisa-conll05-elmo`	86.07	84.64	85.35	86.69	86.42	86.55	78.95	77.17	78.05
`lisa-conll05-elmo` +D&M	85.83	84.51	85.17	87.13	86.67	86.90	79.02	77.49	78.25
`lisa-conll05-elmo` +Gold	88.51	86.77	87.63	---	---	---	---	---	---

CoNLL-2005 results for released models w/ gold predicates (dev, WSJ test, Brown test):

Model	P	R	F1	P	R	F1	P	R	F1
`sa-conll05-gold-preds`	83.12	82.81	82.97	84.80	84.13	84.46	74.83	72.81	73.81
`lisa-conll05-gold-preds`	83.60	83.74	83.67	84.72	84.57	84.64	74.77	74.32	74.55
`lisa-conll05-gold-preds` +D&M	85.38	85.89	85.63	86.58	86.60	86.59	77.43	77.08	77.26
`lisa-conll05-gold-preds` +Gold	89.11	89.38	89.25	---	---	---	---	---	---

CoNLL-2012 results for released models (dev, test):

Model	P	R	F1	P	R	F1
`sa-conll12`	82.32	79.76	81.02	82.55	80.02	81.26
`lisa-conll12`	81.77	79.69	80.72	81.79	79.45	80.60
`lisa-conll12` +D&M	83.58	81.56	82.55	83.71	81.61	82.65
`lisa-conll12` +Gold	87.73	85.31	86.51	---	---	---

`sa-conll12-elmo`	84.09	82.40	83.24	84.28	82.21	83.23
`lisa-conll12-elmo`	84.34	82.73	83.53	84.27	82.47	83.36
`lisa-conll12-elmo` +D&M	84.30	82.98	83.64	84.41	82.87	83.63
`lisa-conll12-elmo` +Gold	88.12	86.40	87.25	---	---	---

CoNLL-2005 dependency parsing results for released models (dev (section 24), WSJ test, Brown test):

Model	UAS	LAS		UAS	LAS		UAS	LAS
`dm-conll05-elmo`	95.25	92.54		96.47	93.94		93.53	89.62

CoNLL-2012 dependency parsing results for released models (dev, test):

Model	UAS	LAS		UAS	LAS
`dm-conll12-elmo`	95.30	92.51		95.30	93.05

Name		Name	Last commit message	Last commit date
Latest commit History 2,039 Commits
bin		bin
config		config
lib		lib
tune		tune
.directory		.directory
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
bucket.py		bucket.py
configurable.py		configurable.py
dataset.py		dataset.py
metabucket.py		metabucket.py
network.py		network.py
vocab.py		vocab.py

License

strubell/LISA-v1

Folders and files

Latest commit

History

Repository files navigation

LISA: Linguistically-Informed Self-Attention

Requirements:

Quick start:

Data setup (CoNLL-2005, GloVe):

Download and evaluate a pre-trained model:

Evaluating with ELMo embeddings:

Run the D&M+ELMo parser

Evaluate LISA using the D&M parse

Evaluate LISA using the gold parse

Train a model:

Results

About

Resources

License

Stars

Watchers

Forks

Languages