MeLT: Message-Level Transformer

This repository contains code for our EMNLP 2021 Findings paper MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection . This repo is activately maintained, if you have questions feel free to email the authors or leave an issue on GH.

About

This work proposes a hierarchical transformer, built on top of distil-bert, that can directly encode sequences of messages within a user-level context. This hierarcical transformer is pre-trained using a style of masked-language modeling applied to sequences of aggergated message vectors. Thus, turning the task into a masked-document modeling via reconstruction loss. The pre-trained transformer was then applied to the downstream task of stance prediction using the SemEval-2016 task 6 data. The pre-training dataset is not open sourced.

Repo Structure

The code in this repostiory is used for constructing MeLT and pre-training it using the masked-document modeling task. Fine-tuning code is not supplied, as the novelty of the paper is focused on the construction and setup of MeLT itself. If there is a lot of interest in the fine-tuning code as well then I'll add it to this repostiory, but the application of MeLT should be much less complex than the pre-training.

The modeling directory stores the class files for defining the necessary helper functions, transformer layers, attention calculations, and MeLT model. These are located in neural.py, encoder_layers.py, attn.py, and encoder.py respectively. There is also a data_handler.py which is used to load in the raw language data (from MySQL) and build a PyTorch dataloader batched by users.

The root directory has a main.py which is used to load in the MeLT model and control training, testing, and hyperparameter tuning modes.

Requirements

An envrionment.yml file is included that represents a conda envrionment of the libraries used for development of this project. At a high level you will need PyTorch (1.4), PyTorch Lightning(0.7.5), Pandas, and the standard numpy stack.

Cite

@inproceedings{matero-etal-2021-melt-message,
    title = "{M}e{LT}: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection",
    author = "Matero, Matthew  and
      Soni, Nikita  and
      Balasubramanian, Niranjan  and
      Schwartz, H. Andrew",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
    month = nov,
    year = "2021",
    address = "Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.findings-emnlp.253",
    pages = "2959--2966",
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
modeling		modeling
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

modeling

modeling

.gitignore

.gitignore

README.md

README.md

environment.yml

environment.yml

main.py

main.py

Repository files navigation

MeLT: Message-Level Transformer

About

Repo Structure

Requirements

Cite

About

Releases

Packages

Languages

MatthewMatero/MeLT

Folders and files

Latest commit

History

Repository files navigation

MeLT: Message-Level Transformer

About

Repo Structure

Requirements

Cite

About

Resources

Stars

Watchers

Forks

Languages