This project uses attention methods to build a source separation and speaker identification pipeline. It outperforms state-of-the-art methods in single-channel mixture speaker identification by far.
Dataset is Hub-4.
Paper accepted interspeech 2020 for presentation. Arxiv link: https://arxiv.org/abs/2005.11408