Skip to content

Latest commit

 

History

History
executable file
·
29 lines (26 loc) · 1.44 KB

README.md

File metadata and controls

executable file
·
29 lines (26 loc) · 1.44 KB

Wikidata2Wikipedia: Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata

This repository contains the code along with the datasets of the work that has been presented as a research paper in NAACL HLT 2018. For a detailed description of the work presented in this repository, please refer to the preprint version of the accepted paper at: https://arxiv.org/abs/1803.07116.

Datasets

In a Unix shell environment execute: sh download_datasets.sh in order to download and uncompress both the datasets in their corresponding folders (i.e. ar and eo).

BibTeX

Please cite the following paper should you use this repository in your work.

@InProceedings{N18-2101,
  author = 	"Kaffee, Lucie-Aim{\'e}e
		and Elsahar, Hady
		and Vougiouklis, Pavlos
		and Gravier, Christophe
		and Laforest, Frederique
		and Hare, Jonathon
		and Simperl, Elena",
  title = 	"Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata",
  booktitle = 	"Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)",
  year = 	"2018",
  publisher = 	"Association for Computational Linguistics",
  pages = 	"640--645",
  location = 	"New Orleans, Louisiana",
  url = 	"http://aclweb.org/anthology/N18-2101"
}

License

This project is licensed under the terms of the Apache 2.0 License.