IDK: Dialogue Relevance

This repository contain the code required to replicate the experiments and data processing for the paper "Relevance in Dialogue: An empirical comparison of existing metrics, and a novel simple metric"

All files are Python 3.6.9 code, necessary libraries can be found in requirements.txt

Instructions

For reproducing the IDK and COS-BERT experiments

Delete the existing exp folder if you wish to reproduce. Otherwise, you can immediately perform steps 6., 8., and the second half of 9.
Download files outlined in data subfolder's README
Run preprocess.py
To reproduce BERT all coherence experiments (i.e., COS-BERT), then run run_bert_coherence.sh, and check the created subfolders under exp. Note that repetitions are not used, as the pretrained BERT is fixed, thus outputs are deterministic
To train models and get validation set performance: python3 -u run_triplet.py | tee log_run_triplet.txt
To view validation results: python3 display_val_results2.py
To evaluate on test set, python3 test_triplet.py
To view test results: python3 display_test_results.py
To create plots from paper: First save model's predictions using python3 test_triplet_save_examples.py. Then create plot with python3 create_plots.py
To reproduce experiment using pretrained BERT NSP predictor directly for relevance prediction, run runk_bert_nsp.sh

For reproducing experiments using prior metrics (i.e., BERT-NUP, COS-FT, NORM-PROB)

After downloading files outlined in data folder and running preprocess.py, change directory into data folder and run csv_ify.py. This will create the csv folder containing the data in csv format. Then change directory into the data/lines directory and run all python scripts there. This will convert the data into a single value-per line format.

Next, just go to the external_projects subdirectory that you are intested, check the corresponding README, checkout the corresponding project, and replace the files/follow the instructions provided.

Code to reproduce the GRADE, DynaEval, and FED metric (i.e. FED-REL and FED-COR) are together as all are a joint modification of the codebase in https://github.com/exe1023/DialEvalMetrics

For reproducing BERT NSP masking experiments

Run histogram/test_nsp_DATASET_all.py. These files use the mask stored in histogram/mask_7_humod_idk_l1_bce.pt; to generate this file using the provided checkpoints, just run histogram/make_hist.py from the root directory.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
data_loader		data_loader
data_preprocess		data_preprocess
exp		exp
external_projects		external_projects
histogram		histogram
model		model
utils		utils
.gitignore		.gitignore
FED-COR.jpg		FED-COR.jpg
FED-REL.jpg		FED-REL.jpg
HUMOD.jpg		HUMOD.jpg
HUMOD_idk.jpg		HUMOD_idk.jpg
P-DD.jpg		P-DD.jpg
README.md		README.md
USR-TC.jpg		USR-TC.jpg
bert_NSP.py		bert_NSP.py
bert_coherence.py		bert_coherence.py
create_plots.py		create_plots.py
display_test_results.py		display_test_results.py
display_val_results2.py		display_val_results2.py
fed_per.txt		fed_per.txt
humod_idk_l1_bce_weights.jpg		humod_idk_l1_bce_weights.jpg
humod_idk_l1_bce_weights_log10.jpg		humod_idk_l1_bce_weights_log10.jpg
humod_l1_bce_weights.jpg		humod_l1_bce_weights.jpg
humod_l1_bce_weights_log10.jpg		humod_l1_bce_weights_log10.jpg
max_bert_coherence.py		max_bert_coherence.py
overlapping_histograms_IDK_ablate_humod.png		overlapping_histograms_IDK_ablate_humod.png
preprocess.py		preprocess.py
requirements.txt		requirements.txt
run_bert_coherence.sh		run_bert_coherence.sh
run_bert_nsp.sh		run_bert_nsp.sh
run_triplet.py		run_triplet.py
test_triplet.py		test_triplet.py
test_triplet_save_examples.py		test_triplet_save_examples.py
unsup_bert.py		unsup_bert.py

ikb-a/idk-dialogue-relevance

Folders and files

Latest commit

History

Repository files navigation

IDK: Dialogue Relevance

Instructions

For reproducing the IDK and COS-BERT experiments

For reproducing experiments using prior metrics (i.e., BERT-NUP, COS-FT, NORM-PROB)

For reproducing BERT NSP masking experiments

About

Resources

Stars

Watchers

Forks

Languages