Wikidata2Wikipedia: Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata
This repository contains the code along with the datasets of the work that has been presented as a research paper in NAACL HLT 2018. For a detailed description of the work presented in this repository, please refer to the preprint version of the accepted paper at: https://arxiv.org/abs/1803.07116.
In a Unix shell environment execute: sh download_datasets.sh in order to download and uncompress both the datasets in their corresponding folders (i.e. ar
and eo
).
Please cite the following paper should you use this repository in your work.
@InProceedings{N18-2101,
author = "Kaffee, Lucie-Aim{\'e}e
and Elsahar, Hady
and Vougiouklis, Pavlos
and Gravier, Christophe
and Laforest, Frederique
and Hare, Jonathon
and Simperl, Elena",
title = "Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata",
booktitle = "Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)",
year = "2018",
publisher = "Association for Computational Linguistics",
pages = "640--645",
location = "New Orleans, Louisiana",
url = "http://aclweb.org/anthology/N18-2101"
}
This project is licensed under the terms of the Apache 2.0 License.