Skip to content

Commit

Permalink
update XStoryCloze dataset description (#5100)
Browse files Browse the repository at this point in the history
  • Loading branch information
todpole3 committed May 8, 2023
1 parent b35e8ef commit 5ecbbf5
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion examples/xglm/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -140,7 +140,7 @@ for lang in ['en', 'zh', 'hi']:

## XStoryCloze

We release XStoryCloze, a new multilingual dataset intended for few-shot evaluation, alongside this paper. XStoryCloze consists of professional translation of the [English StoryCloze dataset](https://cs.rochester.edu/nlp/rocstories/) (Spring 2016 version) to 10 other languages. It is opensourced under [CC BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/legalcode), the same license as the English StoryCloze.
We release XStoryCloze, a new multilingual dataset intended for few-shot evaluation, alongside this paper. XStoryCloze consists of professional translation of the validation split of the [English StoryCloze dataset](https://cs.rochester.edu/nlp/rocstories/) (Spring 2016 version) to 10 other languages. It is opensourced under [CC BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/legalcode), the same license as the English StoryCloze.

You can download the dataset via [this link](https://dl.fbaipublicfiles.com/xstorycloze.zip).

Expand Down
2 changes: 1 addition & 1 deletion examples/xglm/XStoryCloze.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
XStoryCloze consists of professional translation of the [English StoryCloze dataset](https://cs.rochester.edu/nlp/rocstories/) (Spring 2016 version) to 10 other languages. This dataset is released by Meta AI alongside the paper [Few-shot Learning with Multilingual Generative Language Models. EMNLP 2022](https://arxiv.org/abs/2112.10668).
XStoryCloze consists of professional translation of the validation split of the [English StoryCloze dataset](https://cs.rochester.edu/nlp/rocstories/) (Spring 2016 version) to 10 other languages. This dataset is released by Meta AI alongside the paper [Few-shot Learning with Multilingual Generative Language Models. EMNLP 2022](https://arxiv.org/abs/2112.10668).

# Languages
ru, zh (Simplified), es (Latin America), ar, hi, id, te, sw, eu, my.
Expand Down

0 comments on commit 5ecbbf5

Please sign in to comment.