AI CUP 2022: Argument Detection

Task Description

Input data

q: An English discourse.
r: An English text that responds to q.
s: The discussion relationship between r and q.

Output data

𝒒′ & 𝒓′: Subsequences of q and r respectively, and 𝑞′ and 𝑟′provides key information, enough to judge the relationship between q and r presenting s.

See more task description here.

Introduction

We redefine this task into an extractive summarization task, which can also be regarded as a sentence classification task. By calculating scores for each sentence, we decompose sentences and recompose sentences with high scores into the arguments.

Core Model Structure

We concat sequence with q and r into two sequences. For the maximum token length for BERT input is 512, we summarized q and r into 450 tokens at most first, which is made by using pretrained bert-extractive-summarization library. After that, we feed the sequences into BERTs and get thier pooler outputs sq and sr, concating them with their inner dot production sq*sr. Finally, we connect the concated result with dense layers and get the score. Besides the main task for sequence classification, we also design a co-task for s(relationship) classification to help.

Usuage

Requirements

pip install -q torch pytorch-lightning
pip install -q transformers
pip install -q bert-extractive-summarizer
pip install -q nltk==3.7

Model Path

https://drive.google.com/file/d/1-QVxqGodzD0FEQNVOqDsWsgHfu9fkHwO/view?usp=share_link

Train

(Remember to change all file paths into yours!!!)

Download train data here.
Run utils/preprocessing.ipynb for data preprocessing. You may also just use processed data here.
Run train.ipynb to start training.

Predict

(Remember to change all file paths into yours!!!)

Download test data here.
Download pretrained model here or from link above.
Run predict.ipynb to start predicting.

Final Scores

Our best answer is here.

Public Score: 0.815819
Private Score: 0.867684

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

utils

utils

README.md

README.md

predict.ipynb

predict.ipynb

train.ipynb

train.ipynb

Repository files navigation

AI CUP 2022: Argument Detection

Task Description

Input data

Output data

Introduction

Core Model Structure

Usuage

Requirements

Model Path

Train

Predict

Final Scores

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
utils		utils
README.md		README.md
predict.ipynb		predict.ipynb
train.ipynb		train.ipynb

MengChiehLiu/AI_CUP_2022_Argument_Detection

Folders and files

Latest commit

History

Repository files navigation

AI CUP 2022: Argument Detection

Task Description

Input data

Output data

Introduction

Core Model Structure

Usuage

Requirements

Model Path

Train

Predict

Final Scores

About

Topics

Resources

Stars

Watchers

Forks

Languages