This is a monolingual and cross-lingual word representation reading list maintained by muyeby!
- Shyam Upadhyay, Manaal Faruqui, Chris Dyer and Dan Roth. 2016. Cross-lingual Models of Word Embeddings: An Empirical Comparison. In Proceedings of ACL 2016. [BibTeX] [code]
- Ivan Vuli¢, Anders Søgaard, Manaal Faruqui. 2017. Cross-Lingual Word Representations: Induction and Evaluation. In Proceedings of EMNLP 2017.
- Anders Søgaard, Sebastian Ruder, and Ivan Vulić. 2018. On the Limitations of Unsupervised Bilingual Dictionary Induction. In Proceedings of ACL 2018. [BibTeX]
- Mareike Hartmann, Yova Kementchedjhieva and Anders Søgaard. 2018. Why is unsupervised alignment of English embeddings from different algorithms so hard?. In Proceedings of EMNLP 2018. [BibTeX]
- Sebastian Ruder, Ivan Vulić, and Anders Søgaard. 2019. A Survey Of Cross-lingual Word Embedding Models. *Journal of Artificial Intelligence Research. [BibTeX]
- Tomas Mikolov, Quoc V. Le, and Ilya Sutskever. 2013. Exploiting Similarities among Languages for Machine Translation. arxiv:1309.4168. [BibTeX]
- Manaal Faruqui and Chris Dyer. 2014 Improving vector space word representations using multilingual correlation. In Proceedings of EACL 2014. [BibTeX] [Code]
- Georgiana Dinu, Angeliki Lazaridou, and Marco Baroni. 2015. Improving Zero-shot Learning by Mitigating the Hubness Problem. In Proceedings of ICLR 2015. [BibTeX]
- Chao Xing, Dong Wang, Chao Liu, Yiye Lin. 2015. Normalized word embedding and orthogonal transform for bilingual word translation. In Proceedings of NAACL 2015. [BibTeX]
- Angeliki Lazaridou, Georgiana Dinu and Marco Baroni. 2015. Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning. In Proceedings of IJCNLP 2015. [BibTeX]
- Yuan Zhang, David Gaddy, Regina Barzilay and Tommi Jaakkola. 2016. Ten Pairs to Tag – Multilingual POS Tagging via Coarse Mapping between Embeddings. In Proceedings of NAACL 2016. [BibTeX] [Code]
- Mikel Artetxe, Gorka Labaka, and Eneko Agirre. 2016. Learning principled bilingual mappings of word embeddings while preserving monolingual invariance. In Proceedings of EMNLP 2016. [BibTex] [Code]
- Meng Zhang, Yang Liu, Huanbo Luan, Maosong Sun, Tatsuya Izuha, and Jie Hao. 2016. Building Earth Mover's Distance on Bilingual Word Embeddings for Machine Translation. In Proceedings of AAAI 2016. [BibTeX]
- Meng Zhang, Yang Liu, Huanbo Luan, Yiqun Liu, and Maosong Sun. 2016. Inducing Bilingual Lexica From Non-Parallel Data With Earth Mover's Distance Regularization. In Proceedings of COLING 2016. [BibTeX]
- Ivan Vulić and Anna Korhonen. On the Role of Seed Lexicons in Learning Bilingual Word Embeddings. In Proceedings of ACL 2016. [BibTeX]
- Ann Irvine and Chris Callison-Burch. 2017. A Comprehensive Analysis of Bilingual Lexicon Induction. Computational Linguistics. [BibTeX]
- Geert Heyman, Ivan Vulić, and Marie-Francine Moens. 2017. Bilingual Lexicon Induction by Learning to Combine Word-Level and Character-Level Representations. In Proceedings of EACL 2017. [BibTeX]
- Mikel Artetxe, Gorka Labaka, and Eneko Agirre. 2018. Generalizing and improving bilingual word embedding mappings with a multi-step framework of linear transformations. In Proceedings of AAAI 2018. [BibTex] [Code]
- Ndapa Nakashole. 2018. NORMA: Neighborhood Sensitive Maps for Multilingual Word Embeddings. In Proceedings of EMNLP 2018. [BibTeX]
- Yerai Doval, Jose Camacho-Collados, Luis Espinosa Anke, and Steven Schockaert. 2018. Improving Cross-Lingual Word Embeddings by Meeting in the Middle. In Proceedings of EMNLP 2018. [BibTeX]
- Sebastian Ruder, Ryan Cotterell, Yova Kementchedjhieva, and Anders Søgaard. 2018. A Discriminative Latent-Variable Model for Bilingual Lexicon Induction. In Proceedings of EMNLP 2018. [BibTeX] [code]
- Armand Joulin, Piotr Bojanowski, Tomas Mikolov, Hervé Jégou, and Edouard Grave. 2018. Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion. In Proceedings of EMNLP 2018. [BibTeX] [code]
- Samuel L. Smith, David H. P. Turban, Steven Hamblin and Nils Y. Hammerla. 2017. Offline bilingual word vectors, orthogonal transformations and the inverted softmax. In Proceedings of ICLR 2017. [BibTeX]
- Mikel Artetxe, Gorka Labaka, and Eneko Agirre. 2017. Learning Bilingual Word Embeddings with (Almost) No Bilingual Data. In Proceedings of ACL 2017. [BibTeX] [Code]
- Meng Zhang, Haoruo Peng, Yang Liu, Huanbo Luan, and Maosong Sun. 2017. Bilingual Lexicon Induction from Non-Parallel Data with Minimal Supervision. In Proceedings of AAAI 2017. [BibTeX] [Code]
- Antonio Valerio Miceli Barone. 2016. Towards cross-lingual distributed representations without parallel text trained with adversarial autoencoders. In Proceedings of Rep4NLP 2016. [BibTeX]
- Hailong Cao, Tiejun Zhao, Shu ZHANG and Yao Meng. 2016. A Distribution-based Model to Learn Bilingual Word Embeddings. In Proceedings of COLING 2016. [BibTeX]
- Meng Zhang, Yang Liu, Huanbo Luan, and Maosong Sun. 2017. Adversarial Training for Unsupervised Bilingual Lexicon Induction. In Proceedings of ACL 2017. [BibTeX] [Code]
- Bradley Hauer, Garrett Nicolai, and Grzegorz Kondrak. 2017. Bootstrapping Unsupervised Bilingual Lexicon Induction. In Proceedings of EACL 2017. [BibTeX]
- Yunsu Kim, Julian Schamper, and Hermann Ney. 2017. Unsupervised Training for Large Vocabulary Translation Using Sparse Lexicon and Word Classes. In Proceedings of EACL 2017. [BibTeX]
- Derry Tanti Wijaya, Brendan Callahan, John Hewitt, Jie Gao, Xiao Ling, Marianna Apidianaki, and Chris Callison-Burch. 2017. Learning Translations via Matrix Completion. In Proceedings of EMNLP 2017. [BibTeX]
- Meng Zhang, Yang Liu, Huanbo Luan, and Maosong Sun. 2017. Earth Mover's Distance Minimization for Unsupervised Bilingual Lexicon Induction. In Proceedings of EMNLP 2017. [BibTeX] [Code]
- Ndapandula Nakashole and Raphael Flauger. 2017. Knowledge Distillation for Bilingual Dictionary Induction. In Proceedings of EMNLP 2017. [BibTeX]
- Guillaume Lample, Alexis Conneau, Marc'Aurelio Ranzato, Ludovic Denoyer, and Hervé Jégou. 2018. Word Translation without Parallel Data. In Proceedings of ICLR 2018. [BibTeX] [Code]
- Fabienne Braune, Viktor Hangya, Tobias Eder, and Alexander Fraser. 2018. Evaluating Bilingual Word Embeddings on the Long Tail. In Proceedings of NAACL 2018. [BibTeX]
- Ndapa Nakashole and Raphael Flauger. 2018. Characterizing Departures from Linearity in Word Translation. In Proceedings of ACL 2018. [BibTeX]
- Mikel Artetxe, Gorka Labaka, and Eneko Agirre. 2018. A Robust Self-learning Method for Fully Unsupervised Cross-lingual Mappings of Word Embeddings. In Proceedings of ACL 2018. [BibTeX] [Code]
- Parker Riley and Daniel Gildea. 2018. Orthographic Features for Bilingual Lexicon Induction. In Proceedings of ACL 2018. [BibTeX]
- Hanan Aldarmaki, Mahesh Mohan and Mona Diab. 2018. Unsupervised Word Mapping Using Structural Similarities in Monolingual Embeddings. In TACL 2018. [BibTeX]
- Edouard Grave, Armand Joulin and Quentin Berthet. 2018. Unsupervised Alignment of Embeddings with Wasserstein Procrustes In arxiv. [BibTeX]
- Amir Hazem and Emmanuel Morin. 2018. Leveraging Meta-Embeddings for Bilingual Lexicon Extraction from Specialized Comparable Corpora. In Proceedings of COLING 2018. [BibTeX]
- Xilun Chen and Claire Cardie. 2018. Unsupervised Multilingual Word Embeddings. In Proceedings of EMNLP 2018. [BibTeX]
- Zi-Yi Dou, Zhi-Hao Zhou, and Shujian Huang. 2018. Unsupervised Bilingual Lexicon Induction via Latent Variable Models. In Proceedings of EMNLP 2018. [BibTeX]
- Tanmoy Mukherjee, Makoto Yamada and Timothy Hospedales. 2018. Learning Unsupervised Word Translations Without Adversaries. In Proceedings of EMNLP 2018. [BibTeX]
- Yedid Hoshen and Lior Wolf. 2018. Non-Adversarial Unsupervised Word Translation. In Proceedings of EMNLP 2018. [BibTeX] [Code]
- David Alvarez-Melis and Tommi Jaakkola. 2018. Gromov-Wasserstein Alignment of Word Embedding Spaces. In Proceedings of EMNLP 2018. [BibTeX] [Code]
- Ruochen Xu, Yiming Yang, Naoki Otani, Yuexin Wu. 2018. Unsupervised Cross-lingual Transfer of Word Embedding Spaces. In Proceedings of EMNLP 2018. [BibTeX] [Code]
- Lifu Huang1, Kyunghyun Cho, Boliang Zhang, Heng Ji and Kevin Knight. 2018. Multi-lingual Common Semantic Space Construction via Cluster-consistent Word Embedding. In Proceedings of EMNLP 2018. [BibTeX] [Code]
- Yova Kementchedjhieva, Sebastian Ruder, Ryan Cotterell and Anders Søgaard. 2018. Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction. In Proceedings of CONLL 2018. [BibTeX] [Code]
- Pratik Jawanpuria, Arjun Balgovind, Anoop Kunchukuttan and Bamdev Mishra. 2019. Learning Multilingual Word Embeddings in Latent Metric Space: A Geometric Approach. In TACL 2019. [BibTeX] [Code]
- Shizhe Chen, Qin Jin, Alexander Hauptmann. 2019. [Unsupervised Bilingual Lexicon Induction from Monolingual Multimodal Data](to be added) In Proceedings of AAAI 2019. [[BibTeX](to be added)]
- Tasnim Mohiuddin and Shafiq Joty. 2019. Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training. In Proceedings of NAACL 2019. [BibTeX] [Code]
- Chunting Zhou, Xuezhe Ma, Di Wang, and Graham Neubig. 2019. Density Matching for Bilingual Word Embedding. In Proceedings of NAACL 2019. [BibTeX] [Code]
- Noa Yehezkel Lubin, Jacob Goldberger, and Yoav Goldberg. 2019. Aligning Vector-spaces with Noisy Supervised Lexicons. In Proceedings of NAACL 2019. [BibTeX] [Code]
- Tal Schuster, Ori Ram, Regina Barzilay, and Amir Globerson. 2019. Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing. In Proceedings of NAACL 2019. [BibTeX] [Code]
- Hanan Aldarmaki and Mona Diab. 2019. Context-Aware Cross-Lingual Mapping. In Proceedings of NAACL 2019. [BibTeX] [Code]
- Yoshinari Fujinuma, Jordan Boyd-Graber, and Michael J. Paul. 2019. A Resource-Free Evaluation Metric for Cross-Lingual Word Embeddings Based on Graph Modularity. In Proceedings of ACL 2019. [BibTeX]
- Mozhi Zhang, Keyulu Xu, Ken-ichi Kawarabayashi, Stefanie Jegelka, and Jordan Boyd-Graber. 2019. Are Girls Neko or Shōjo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization. In Proceedings of ACL 2019. [BibTeX]
- Aitor Ormazabal, Mikel Artetxe, Gorka Labaka, Aitor Soroa, and Eneko Agirre. 2019. Analyzing the Limitations of Cross-lingual Word Embedding Mappings. In Proceedings of ACL 2019. [BibTeX]
- Takashi Wada, Tomoharu Iwata, and Yuji Matsumoto. 2019. Unsupervised Multilingual Word Embedding with Limited Resources using Neural Language Models. In Proceedings of ACL 2019. [BibTeX]
- Pengcheng Yang, Fuli Luo, Peng Chen, Tianyu Liu, and Xu Sun. 2019. MAAM: A Morphology-Aware Alignment Model for Unsupervised Bilingual Lexicon Induction. In Proceedings of ACL 2019. [BibTeX]
- Benjamin Marie and Atsushi Fujita. 2019. Unsupervised Joint Training of Bilingual Word Embeddings. In Proceedings of ACL 2019. [BibTeX]
- Mikel Artetxe, Gorka Labaka, and Eneko Agirre. 2019. Bilingual Lexicon Induction through Unsupervised Machine Translation. In Proceedings of ACL 2019. [BibTeX]
- Barun Patra, Joel Ruben Antony Moniz, Sarthak Garg, Matthew R. Gormley and Graham Neubig. Bilingual Lexicon Induction with Semi-supervision in Non-Isometric Embedding Spaces In Proceedings of ACL 2019 [BibTeX]
- Elias Stengel-Eskin, Tzu-Ray Su, Matt Post, and Benjamin Van Durme. 2019. A Discriminative Neural Model for Cross-Lingual Word Alignment. In Proceedings of EMNLP 2019. [BibTeX]
- Ivan Vulić, Goran Glavaš, Roi Reichart, and Anna Korhonen. 2019. Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?. In Proceedings of EMNLP 2019. [BibTeX]
- Paula Czarnowska, Sebastian Ruder, Edouard Grave, Ryan Cotterell, and Ann Copestake. 2019. Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction. In Proceedings of EMNLP 2019. [BibTeX]
- Yova Kementchedjhieva, Mareike Hartmann, and Anders Søgaard. 2019. Lost in Evaluation: Misleading Benchmarks for Bilingual Dictionary Induction. In Proceedings of EMNLP 2019. [BibTeX]