Skip to content

HLTCHKUST/Xpersona

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

XPersona: Evaluating Multilingual Personalized Chatbot

License: MIT

This is the source code of the paper:

XPersona: Evaluating Multilingual Personalized Chatbot. [PDF]

The paper has been presented at NLP4ConvAI at EMNLP 2021 (Honorable Mention Paper). This code has been written using PyTorch. If you use source codes or datasets included in this toolkit in your work, please cite the following papers:

XPersona

@inproceedings{lin2021xpersona,
  title={XPersona: Evaluating Multilingual Personalized Chatbot},
  author={Lin, Zhaojiang and Liu, Zihan and Winata, Genta Indra and Cahyawijaya, Samuel and Madotto, Andrea and Bang, Yejin and Ishii, Etsuko and Fung, Pascale},
  booktitle={Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI},
  pages={102--112},
  year={2021}
}

English PersonaChat

@article{zhang2018personalizing,
  title={Personalizing Dialogue Agents: I have a dog, do you have pets too?},
  author={Zhang, Saizheng and Dinan, Emily and Urbanek, Jack and Szlam, Arthur and Kiela, Douwe and Weston, Jason},
  journal={arXiv preprint arXiv:1801.07243},
  year={2018}
}

Dataset

XPersona dataset is an extension of the persona-chat dataset. Specifically, we extend the ConvAI2 to the other six languages: Chinese, French, Indonesian, Italian, Korean, and Japanese.

Baselines

In this work, we provided multilingual and crosslingual trained baselines. See multilingual and crosslingual folder for more details.

Acknowledgement

This repository is implemented using Huggingface codebase.