Attention on SNLI

Unofficial implementation algorithms of attention models on SNLI dataset.

Current include papers:

Based on Lasagne.

Requirements

At source root dir

First extracts preprocessed SNLI data ./extract_data.sh

Then run: python3 ./snli_reasoning_attention.py [condition|attention|word_by_word]

Or run: python3 ./snli_match_lstm.py

The learning curve of word by word attention(best test acc is at epoch 41):

Epoch: 1-20

Epoch: 20-39

Epoch: 40-59

The learning curve of match LSTM with word embedding:

About word by word attention:

The test acc of word by word attention is 0.2% smaller than the original paper, 83.29% (41 epochs)
every 20 epochs, we reduce learning_rate, see log files for detail info.

About match LSTM:

[3]:

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
snli		snli
.gitignore		.gitignore
README.md		README.md
condition_lstm.log		condition_lstm.log
condition_lstm.png		condition_lstm.png
custom_layers.py		custom_layers.py
extract_data.sh		extract_data.sh
gen_figure_from_log.py		gen_figure_from_log.py
mlstm_word2vec_embedding.log		mlstm_word2vec_embedding.log
mlstm_word2vec_embedding.png		mlstm_word2vec_embedding.png
mlstm_word_embedding_at_epoch7-8.log		mlstm_word_embedding_at_epoch7-8.log
requirements.txt		requirements.txt
sample_file.txt		sample_file.txt
sample_file_not_in_snli.txt		sample_file_not_in_snli.txt
simple.txt		simple.txt
snli_dataset_exploration.ipynb		snli_dataset_exploration.ipynb
snli_match_lstm.py		snli_match_lstm.py
snli_match_lstm_predict.py		snli_match_lstm_predict.py
snli_reasoning_attention.py		snli_reasoning_attention.py
test_snli_match_lstm_current_acc.py		test_snli_match_lstm_current_acc.py
wordbyword_attention_dropout0.2.log		wordbyword_attention_dropout0.2.log
wordbyword_attention_dropout0.2.png		wordbyword_attention_dropout0.2.png
wordbyword_attention_dropout0.2_after19.log		wordbyword_attention_dropout0.2_after19.log
wordbyword_attention_dropout0.2_after19.png		wordbyword_attention_dropout0.2_after19.png
wordbyword_attention_dropout0.2_after39.log		wordbyword_attention_dropout0.2_after39.log
wordbyword_attention_dropout0.2_after39.png		wordbyword_attention_dropout0.2_after39.png