Controlled-Dialogue-Generation

This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks" (EMNLP2021-Findings).

Environment

Under the server environment of python=3.6 and CUDA 11.1, install the following packages:

pip install -r requirements.txt

Training

Training the DialoGPT-SideNet on DailyDialog full training set includes two steps. First, pretrain a DA classifier:

python gpt2_da_sidenet.py -d dailydialog_raw -t train -f base --pretrain_clf

Second, train a SideNet (remember to replace the timestamp and ckpt with your own model checkpoint):

python gpt2_da_sidenet.py -d dailydialog_raw -t ft -f sidenet --timestamp 2021-05-11-07-08-10 --ckpt 2500

Training the DialoGPT-SideNet on ConvAI2 full training set includes one step:

python gpt2_kb_sidenet.py -d convai2_raw -t train -f sidenet

Decoding

Get the DialoGPT-SideNet predictions on DailyDialog full testing set (remember to replace the timestamp and ckpt with your own model checkpoint):

python gpt2_da_sidenet.py -d dailydialog_raw -t eval -f sidenet --timestamp 2021-05-11-07-08-10 --ckpt 2500

Get the DialoGPT-SideNet predictions on ConvAI2 full testing set (remember to replace the timestamp and ckpt with your own model checkpoint):

python gpt2_kb_sidenet.py -d convai2_raw -t eval -f sidenet --timestamp 2021-04-26-10-21-06 --ckpt 47839

Evaluation

Compute text quality metrics for DialoGPT-SideNet predictions:

python evaluation.py \
--mode sent \
--reference_file {args.model_name}_{args.dataset}_{args.flag}_{args.timestamp}/refs.json \
--output_file {args.model_name}_{args.dataset}_{args.flag}_{args.timestamp}/outs.json

Compute text controllability metrics (knowledge document control) for DialoGPT-SideNet predictions:

python evaluation.py \
--mode kb \
--reference_file {args.model_name}_{args.dataset}_{args.flag}_{args.timestamp}/refs.json \
--output_file {args.model_name}_{args.dataset}_{args.flag}_{args.timestamp}/outs.json

Compute text controllability metrics (semantic label control) for DialoGPT-SideNet predictions. First, train an independent DA classifier:

python bert_da_eval.py -d dailydialog_dis -t train -f clf

Second, compute the accuracy predicted by the independent DA classifier (remember to replace the timestamp, ckpt and output_file accordingly):

python bert_da_eval.py -d dailydialog_dis -t pred -f clf \
--timestamp 2021-04-11-05-57-18 --ckpt 10000 \
--output_file {args.model_name}_{args.dataset}_{args.flag}_{args.timestamp}/outs.json

Citation

Please cite our work if you are interested.

@inproceedings{du-ji-2021-sidecontrol-controlled,
    title = "{S}ide{C}ontrol: Controlled Open-domain Dialogue Generation via Additive Side Networks",
    author = "Du, Wanyu  and
      Ji, Yangfeng",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
    month = nov,
    year = "2021",
    address = "Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.findings-emnlp.188",
    pages = "2175--2194",
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Controlled-Dialogue-Generation

Environment

Training

Decoding

Evaluation

Citation

About

Releases

Packages

Languages

License

wyu-du/Controlled-Dialogue-Generation

Folders and files

Latest commit

History

Repository files navigation

Controlled-Dialogue-Generation

Environment

Training

Decoding

Evaluation

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages