EMNLP 2017 submission

This repository contains the dataset and statistical analysis code released with the submission of EMNLP 2017 paper "Why We Need New Evaluation Metrics for NLG".

File descriptions:

emnlp_data_individual_hum_scores.csv - the dataset with system outputs and evaluation ratings of 3 crowd-workers for each output
emnlp_data_individual_hum_scores.csv - the dataset with system outputs, original human references, scores of automatic metrics and medians of human ratings
analysis_emnlp.R - R code with statistical analysis discussed in the paper

Citing the paper:

Jekaterina Novikova, Ondrej Dusek, Amanda Cercas-Curry and Verena Rieser (2017): Why We Need New Evaluation Metrics for NLG. In Proceedings of the Conference on Empirical Methods in Natural Language Processing EMNLP 2017, Copenhaged, Denmark

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
LICENSE.md		LICENSE.md
README.md		README.md
analysis_emnlp.R		analysis_emnlp.R
emnlp_data_individual_hum_scores.csv		emnlp_data_individual_hum_scores.csv
emnlp_data_medians.csv		emnlp_data_medians.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE.md

LICENSE.md

README.md

README.md

analysis_emnlp.R

analysis_emnlp.R

emnlp_data_individual_hum_scores.csv

emnlp_data_individual_hum_scores.csv

emnlp_data_medians.csv

emnlp_data_medians.csv

Repository files navigation

EMNLP 2017 submission

File descriptions:

Citing the paper:

About

Releases

Packages

Languages

License

jeknov/EMNLP_17_submission

Folders and files

Latest commit

History

Repository files navigation

EMNLP 2017 submission

File descriptions:

Citing the paper:

About

Resources

License

Stars

Watchers

Forks

Languages