Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Update MultiWOZ dataset from 2.1 to 2.2 #55

Open
derekchen14 opened this issue Jul 16, 2020 · 11 comments
Open

[Feature] Update MultiWOZ dataset from 2.1 to 2.2 #55

derekchen14 opened this issue Jul 16, 2020 · 11 comments

Comments

@derekchen14
Copy link

Given the release of MultiWoZ 2.2, it seems like the baselines should all be retrained using the cleanest version of the dataset. Paper: https://www.aclweb.org/anthology/2020.nlp4convai-1.13/

@zqwerty
Copy link
Member

zqwerty commented Jul 17, 2020

Thanks! We've noticed MultiWOZ 2.2. We will add it if it is of high quality

@chris-boson
Copy link

chris-boson commented Jan 2, 2021

Also would be great to support the new format (which will also make it easy to add SGD).

@zqwerty
Copy link
Member

zqwerty commented Jan 8, 2021

We are planning to add many datasets (SchemaGuided, Taskmaster, etc.) using a unified format.

@tomolopolis
Copy link

great that you're planning to add SGD and Taskmaster, any updates on when that will be available

@zqwerty
Copy link
Member

zqwerty commented Mar 10, 2021

Actually, we have processed SGD, Taskmaster, and other datasets. We will update them with MultiWOZ 2.2 & 2.3 in few days. Thanks!

@tomolopolis
Copy link

great stuff - looking forward to it!

@zqwerty
Copy link
Member

zqwerty commented Mar 10, 2021

@tomolopolis SGD and Taskmaster are available in unified format #180.

@tomolopolis
Copy link

@zqwerty thanks for that, are there plans to replicate (some) of the existing supported model implementations to use the unified format? then have the various datasets configurable in each model, given the consistent format?

For example some new modules might be:
convlab2/nlu/jointBERT/unified/nlu.py
convlab2/dst/comer/unified/dst.py
convlab2/policy/gdpl/unified/policy.py
convlab2/nlg/sclstm/unified/nlg.py
...

@zqwerty
Copy link
Member

zqwerty commented Mar 13, 2021

@tomolopolis we will modify the unified data process and support some of the useful models. However, some models have a lot of dataset-specific processes which can not be well unified.

@zqwerty
Copy link
Member

zqwerty commented Mar 14, 2021

@tomolopolis we have added multiwoz 2.2 and multiwoz-coref. Check 34960ff in master. However, I deleted the previous commit in order to remove git lfs due to the limited bandwidth for download.
I've noticed that you have merged the previous pull-request. Hope that will not bother you too much.

@tomolopolis
Copy link

@zqwerty Thanks for adding those. No worries about deleting the previous commit, I can pull in the latest

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants