How Familiar Does That Sound? 👂 💬

Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings ☕

BlackboxNLP 2021 ⬛ 🤖

This is the code base to train different models acoustic word embeddings (AWEs), evaluation scripts, and the representational similarity analysis (RSA) experiments reported in our BlackboxNLP paper in EMNLP 2021.

📝 How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

Dependencies 🧬

python 3.8, pytorch 1.1, numpy, scipy, faiss, pickle, pandas, yaml

Speech Data 💬 🗨️

The data in our study is drawn from the Multilingual GlobalPhone read speech database. We experiment with seven Indo-European languages: Czech 🇨🇿, Polish 🇵🇱, Russian 🇷🇺, Bulgarian 🇧🇬, Brazilian Portuguese 🇧🇷, French 🇫🇷, and German 🇩🇪. Because the data is distributed under a research license by Appen Butler Hill Pty Ltd., we cannot re-distribute the raw speech data. However, if you have already access to the GlobalPhone speech database and you would like to access to our word-alignment annotations, train/test splits, and word-level IPA transcriptions, please contact the first author.

Working with the code 🐍

To run a training experiment, write down all hyperparameters and other info in the config file config_file_train_awe_bigru_seq2seq.yml

Then ...

>>> cd xRSA-AWEs
>>> python nn_train_seq2seq_embeddings.py config_files/config_file_train_awe_bigru_seq2seq.yml

To conduct the RSA using CKA on a pre-trained AWE model, make sure the paths for the different pre-trained models are in this config file config_file_eval_rsa_bigru_seq2seq_cs.yml

Then ...

>>> cd xRSA-AWEs
>>> python nn_eval/nn_eval_seq2seq_embeddings_RSA.py config_file_eval_rsa_bigru_seq2seq_cs.yml

The code is fairly documented and the vectorization logic, as well as the code for the models, should be useful for other speech technology tasks. If you use our code and encounter problems, please create an issue or contact the first author.

If you use our code in your work or get inspired by the ideas presented in our paper, please cite our paper as

@inproceedings{Abdullah2021HowFD,
  title={How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings},
  author={Badr M. Abdullah and Iuliia Zaitova and Tania Avgustinova and Bend Möbius and Dietrich Klakow},
  booktitle={Proc. BlackboxNLP},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
config_files		config_files
metadata		metadata
CKA.py		CKA.py
README.md		README.md
nn_eval_CAE_embeddings_RSA.py		nn_eval_CAE_embeddings_RSA.py
nn_eval_contrastive_embeddings_RSA.py		nn_eval_contrastive_embeddings_RSA.py
nn_eval_cross_model_embeddings_RSA.py		nn_eval_cross_model_embeddings_RSA.py
nn_eval_seq2seq_embeddings_RSA.py		nn_eval_seq2seq_embeddings_RSA.py
nn_speech_models.py		nn_speech_models.py
nn_train_CAE_embeddings.py		nn_train_CAE_embeddings.py
nn_train_auto_encoder_embeddings.py		nn_train_auto_encoder_embeddings.py
nn_train_contrastive_embeddings.py		nn_train_contrastive_embeddings.py
nn_train_seq2seq_embeddings.py		nn_train_seq2seq_embeddings.py
train_utils.py		train_utils.py

uds-lsv/xRSA-AWEs

Folders and files

Latest commit

History

Repository files navigation

How Familiar Does That Sound? 👂 💬

Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings ☕

BlackboxNLP 2021 ⬛ 🤖

Dependencies 🧬

Speech Data 💬 🗨️

Working with the code 🐍

About

Resources

Stars

Watchers

Forks

Languages