The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings

The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings. Binny Mathew, Sandipan Sikdar, Florian Lemmerich, and Markus Strohmaier accepted at The Web Conference (WWW) 2020.

Please cite our paper in any published work that uses any of these resources.

@inproceedings{mathew2020polar,
  title={The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings},
  author={Mathew, Binny and Sikdar, Sandipan and Lemmerich, Florian and Strohmaier, Markus},
  booktitle={Proceedings of The Web Conference 2020},
  pages={1548--1558},
  year={2020}
}

Abstract

We introduce ‘POLAR’ — a framework that adds interpretability to pre-trained word embeddings via the adoption of semantic dif- ferentials. Semantic differentials are a psychometric construct for measuring the semantics of a word by analysing its position on a scale between two polar opposites (e.g., cold – hot, soft – hard). The core idea of our approach is to transform existing, pre-trained word embeddings via semantic differentials to a new “polar” space with interpretable dimensions defined by such polar opposites. Our framework also allows for selecting the most discriminative di- mensions from a set of polar dimensions provided by an oracle, i.e., an external source. We demonstrate the effectiveness of our framework by deploying it to various downstream tasks, in which our interpretable word embeddings achieve a performance that is comparable to the original word embeddings. We also show that the interpretable dimensions selected by our framework align with human judgement. Together, these results demonstrate that interpretability can be added to word embeddings without com- promising performance. Our work is relevant for researchers and engineers interested in interpreting pre-trained word embeddings.

The framework takes pre- trained word embeddings as an input and generates word embeddings with interpretable (polar) dimensions as an out- put. In this example, the embeddings are generated by ap- plying POLAR to embeddings pre-trained on Google News dataset with Word2Vec.

Requirements

Numpy
Sklearn
Tqdm
Gensim
Nltk

Running the code

Results can be obtained using the main notebook (main.ipynb)

The code is commented for ease of use

Embeddings Files

The links for the embeddings used -

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Antonym_sets		Antonym_sets
Downstream Task		Downstream Task
POLAR.png		POLAR.png
Readme.md		Readme.md
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Antonym_sets

Antonym_sets

Downstream Task

Downstream Task

POLAR.png

POLAR.png

Readme.md

Readme.md

main.ipynb

main.ipynb

Repository files navigation

The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings

Abstract

Requirements

Running the code

Embeddings Files

About

Releases

Packages

Contributors 2

Languages

Sandipan99/POLAR

Folders and files

Latest commit

History

Repository files navigation

The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings

Abstract

Requirements

Running the code

Embeddings Files

About

Topics

Resources

Stars

Watchers

Forks

Languages