Skip to content

hardyqr/Bilingual-Lexicon-Induction-an-incomplete-list

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 

Repository files navigation

Bilingual Lexicon Induction and Hubness Problem: an incomplete list

Conferences

Self-Tuning Spectral Clustering.
Lihi Zelnik-Manor, Pietro Perona.
(NIPS 2004)
[paper]

Learning Bilingual Lexicons from Monolingual Corpora.
Aria Haghighi, Percy Liang, Taylor Berg-Kirkpatrick, Dan Klein.
(ACL 2008)
[paper]

Nearest Neighbors in High-Dimensional Data: The Emergence and Influence of Hubs.
Milos Radovanovic, Alexandros Nanopoulos, Mirjana Ivanovic.
(ICML 2009)
[paper]

On the Existence of Obstinate Results in Vector Space Models.
Milos Radovanovic, Alexandros Nanopoulos, Mirjana Ivanovic.
(SIGIR 2010)
[paper]

Supervised Bilingual Lexicon Induction with Multiple Monolingual Signals.
Ann Irvine, Chris Callison-Burch.
(NAACL 2013)
[paper]

Normalized word embedding and orthogonaltransform for bilingual word translation.
Chao Xing, Dong Wang, Chao Liu, Yiye Lin.
(NAACL 2015)
[paper]

IMPROVING ZERO-SHOT LEARNING BY MITIGATING THE HUBNESS PROBLEM.
Georgiana Dinu, Angeliki Lazaridou, Marco Baroni.
(ICLR 2015 Workshop)
[paper]

Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning.
Angeliki Lazaridou, Georgiana Dinu, Marco Baroni.
(ACL 2015)
[paper]

Ridge Regression, Hubness, and Zero-Shot Learning.
Yutaro Shigeto, Ikumi Suzuki, Kazuo Hara, Masashi Shimbo, Yuji Matsumoto.
(ECML-PKDD 2015)
[paper]

A distribution-based model to learn bilingual word embeddings.
Hailong Cao, Tiejun Zhao, Shu Zhang, Yao Meng.
(COLING 2016)
[paper]

Learning principled bilingual mappings of word embeddings while preserving monolingual invariance.
Mikel Artetxe, Gorka Labaka, Eneko Agirre.
(EMNLP 2016)
[paper]

Adversarial Training for Unsupervised Bilingual Lexicon Induction.
Meng Zhang, Yang Liu, Huanbo Luan, Maosong Sun.
(ACL 2017)
[paper] [code]

Learning bilingual word embeddings with (almost) no bilingual data.
Mikel Artetxe, Gorka Labaka, Eneko Agirre.
(NIPS 2004)
[paper]

Earth Mover's Distance Minimization for Unsupervised Bilingual Lexicon Induction.
Meng Zhang, Yang Liu, Huanbo Luan, Maosong Sun.
(EMNLP 2017)
[paper]

OFFLINE BILINGUAL WORD VECTORS, ORTHOGONAL TRANSFORMATIONS AND THE INVERTED SOFTMAX.
Samuel L. Smith, David H. P. Turban, Steven Hamblin, Nils Y. Hammerla.
(ICLR 2017)
[paper] [code]

WORD TRANSLATION WITHOUT PARALLEL DATA.
Alexis Conneau, Guillaume Lample, Marc’Aurelio Ranzato, Ludovic Denoyer, Herve Jégou.
(ICLR 2018)
[paper] [MUSE github] [fastText github]

A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings.
Mikel Artetxe, Gorka Labaka, Eneko Agirre.
(ACL 2018)
[paper]

On the limitations of unsupervised bilingual dictionary induction.
Anders Søgaard, Sebastian Ruder, Ivan Vulić.
(ACL 2018)
[paper]

Bridging languages through images with deep partialcanonical correlation analysis.
Guy Rotman, Ivan Vulić, Roi Reichart.
(ACL 2018)
[paper]

A Discriminative Latent-Variable Model for Bilingual Lexicon Induction.
Sebastian Ruder, Ryan Cotterell, Yova Kementchedjhieva, Anders Søgaard.
(EMNLP 2018)
[paper]

Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion.
Armand Joulin, Piotr Bojanowski, Tomas Mikolov, Herve Jégou, Edouard Grave.
(EMNLP 2018)
[paper] [fastText github]

Unsupervised Bilingual Lexicon Induction via Latent Variable Models.
Zi-Yi Dou, Zhi-Hao Zhou, Shujian Huang.
(EMNLP 2018)
[paper]

Unsupervised Multilingual Word Embeddings.
Xilun Chen, Claire Cardie.
(EMNLP 2018)
[paper] [code]

Gromov-Wasserstein Alignment of Word Embedding Spaces.
David Alvarez-Melis, Tommi S. Jaakkola.
(EMNLP 2018)
[paper]

Learning Translations via Images with a Massively Multilingual Image Dataset. John Hewitt, Daphne Ippolito, Brendan Callahan, Reno Kriz, Derry Tanti Wijaya, Chris Callison-Burch.
(EMNLP 2018)
[paper]

Density Matching for Bilingual Word Embedding.
Chunting Zhou, Xuezhe Ma, Di Wang, Graham Neubig.
(NAACL 2019)
[paper] [code]

Learning Unsupervised Multilingual Word Embeddings with Incremental Multilingual Hubs.
Geert Heyman, Bregt Verreet, Ivan Vulić, Marie-Francine Moens.
(NAACL 2019)
[paper]

Aligning Vector-spaces with Noisy Supervised Lexicons.
Noa Yehezkel Lubin, Jacob Goldberger, Yoav Goldberg.
(NAACL 2019)
[paper] [code]

Unsupervised Alignment of Embeddings with Wasserstein Procrustes.
Edouard Grave, Armand Joulin, Quentin Berthet.
(AISTATS 2019)
[paper]

Learning Generative Models across Incomparable Spaces.
Charlotte Bunne, David Alvarez-Melis, Andreas Krause, Stefanie Jegelka.
(ICML 2019)
[paper] [slides] [code]

Bilingual Lexicon Induction with Semi-supervision in Non-Isometric Embedding Spaces.
Barun Patra, Joel Ruben Antony Moniz, Sarthak Garg, Matthew R. Gormley, Graham Neubig.
(ACL 2019)
[paper] [code]

How to (properly) evaluate cross-lingual word embeddings: On strong baselines, comparative analyses, and some misconceptions.
Goran Glavaš, Robert Litschko, Sebastian Ruder, Ivan Vulić.
(ACL 2019)
[paper]

Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction.
Paula Czarnowska, Sebastian Ruder, Edouard Grave, Ryan Cotterell, Ann Copestake.
(EMNLP 2019)
[paper]

Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?
Ivan Vulić, Goran Glavaš, Roi Reichart, Anna Korhonen.
(EMNLP 2019)
[paper]

HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs.
Fangyu Liu, Rongtian Ye, Xun Wang, Shuaipeng Li.
(AAAI 2020)
[paper] [code]

CROSS-LINGUAL ALIGNMENT VS JOINT TRAINING: A COMPARATIVE STUDY AND A SIMPLE UNIFIED FRAMEWORK.
Zirui Wang, Jiateng Xie, Ruochen Xu, Yiming Yang, Graham Neubig, Jaime Carbonell.
(ICLR 2020)
[paper]

Visual Grounding in Video for Unsupervised Word Translation.
Gunnar A. Sigurdsson, Jean-Baptiste Alayrac, Aida Nematzadeh, Lucas Smaira, Mateusz Malinowski, João Carreira, Phil Blunsom, Andrew Zisserman.
(CVPR 2020)
[paper] [code]

A Graph-based Coarse-to-fine Method for Unsupervised Bilingual Lexicon Induction.
Shuo Ren, Shujie Liu, Ming Zhou, Shuai Ma.
(ACL 2020)
[paper]

A Graph-based Coarse-to-fine Method for Unsupervised Bilingual Lexicon Induction.
Shuo Ren, Shujie Liu, Ming Zhou, Shuai Ma.
(ACL 2020)
[paper]

Non-Linear Instance-Based Cross-Lingual Mapping for Non-Isomorphic Embedding Spaces.
Goran Glavaš, Ivan Vulić.
(ACL 2020)
[paper]

Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction.
Mladen Karan, Ivan Vulić, Anna Korhonen, Goran Glavaš.
(ACL 2020)
[paper]

Journals

Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data.
Milos Radovanovic, Alexandros Nanopoulos, Mirjana Ivanovic.
(JMLR 2010)
[paper]

Accurate image search using the contextual dissimilarity measure.
Hervé Jégou, Cordelia Schmid, Hedi Harzallah, Jakob Verbeek.
(IPAMI 2011)
[paper]

Local and Global Scaling Reduce Hubs in Space.
Dominik Schnitzer, Arthur Flexer, Markus Schedl, Gerhard Widmer.
(JMLR 2012)
[paper] [code]

Learning multilingual word embeddings in latent metric space: a geometric approach.
Pratik Jawanpuria, Arjun Balgovind, Anoop Kunchukuttan, Bamdev Mishra.
(TACL 2019)
[paper]

A Survey of Cross-lingual Word Embedding Models.
Sebastian Ruder, Ivan Vulić, Anders Søgaard.
(JAIR 2019)
[paper]