X-SPELLS - eXplaining Sentiment Prediction by generating ExempLars in the Latent Space

Orestis Lampridis, Aristotle University of Thessaloniki, Greece, lorestis@csd.auth.gr
Riccardo Guidotti, Department of Computer Science, University of Pisa, Italy, riccardo.guidotti@unipi.it
Salvatore Ruggieri, Department of Computer Science, University of Pisa, Italy, salvatore.ruggieri@unipi.it

The recent years have witnessed a rapid increase in the use of machine learning models in a wide range of application fields, including businesses, self-driving cars, medicine, public policy, and many others. A large part of those machine learning models are black boxes, i.e., their overall functioning and the logic behind their decisions for a given input instance are not clearly understandable to humans. We present XSPELLS, a model-agnostic local approach for explaining the decisions of a black box model for sentiment classification of short texts. The explanations provided consist of a set of exemplar sentences and a set of counter-exemplar sentences. The former are examples classified by the black box with the same label as the text to explain. The latter are examples classified with a different label (a form of counterfactuals). Both are close in meaning to the text to explain, and both are meaningful sentences, albeit they are synthetically generated. XSPELLS generates neighbours of the text to explain in a latent space using Variational Autoencoders for encoding text and decoding latent instances. A decision tree is learned from randomly generated neighbours, and used to drive the selection of the exemplars and counter-exemplars.

Keras implementation of VAE based on code in alexeyev's repo.

An improved version of X-SPELLS is available here.

License

MIT License for the source code in the lstm_vae directory.
BSD 2-Clause "Simplified" License for the source code in the lime directory.
Apache-2.0 License for the rest of the project.

Citation

O. Lampridis, R. Guidotti, S. Ruggieri. Explaining Sentiment Classification with Synthetic Exemplars and Counter-Exemplars. Discovery Science (DS 2020). 357-373. Vol. 12323 of LNCS, Springer, September 2020.

References

Bowman, S.R., Vilnis, L., Vinyals, O., Dai, A.M., Jozefowicz, R. and Bengio, S., 2015. Generating sentences from a continuous space. arXiv preprint arXiv:1511.06349.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
data		data
experiments		experiments
lime		lime
lstm_vae		lstm_vae
models		models
.gitattributes		.gitattributes
DNN_base.py		DNN_base.py
LICENSE.md		LICENSE.md
README.md		README.md
create_explanations.py		create_explanations.py
decision_tree.py		decision_tree.py
hate_tweets_DNN.py		hate_tweets_DNN.py
hate_tweets_RF.py		hate_tweets_RF.py
polarity_tweets_DNN.py		polarity_tweets_DNN.py
polarity_tweets_RF.py		polarity_tweets_RF.py
pre_processing.py		pre_processing.py
requirements.txt		requirements.txt
train_vae.py		train_vae.py
vectorize.py		vectorize.py

License

orestislampridis/X-SPELLS

Folders and files

Latest commit

History

Repository files navigation

X-SPELLS - eXplaining Sentiment Prediction by generating ExempLars in the Latent Space

License

Citation

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages