How did you generate the input data files like data.pkl, word2id and word_embedding.pkl ? #29

yangliuy · 2019-03-05T01:27:25Z

Firstly thanks for the great ACL paper and open source code!

I have a question on the data preprocessing part. How did you generate the input data files like data.pkl, word2id,vocab.txt and word_embedding.pkl ? Let's take UDC as the example. The raw data only contains train.txt/valid.txt/test.txt. I checked your code and there are no scripts on generating these files like data.pkl and word_embedding.pkl. Could you also upload these data preprocessing scripts ?

xyzhou-puck · 2019-03-05T02:20:53Z

Hi,

We got those data by hacking the source code of SMN, to make sure that our experimental data sets are the same.

Xiangyang

yangliuy · 2019-03-05T04:20:12Z

Hi Xiangyang,

Thank you for your reply! I found a similar question here #5 . I will check the preprocessing code of SMN.

xyzhou-puck · 2019-03-05T08:21:21Z

You are welcome.

MASTERPlECE · 2019-04-16T13:03:36Z

Hi! Do you know how to deal with .w2v file? How to transfer it to word_embedding.pkl?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How did you generate the input data files like data.pkl, word2id and word_embedding.pkl ? #29

How did you generate the input data files like data.pkl, word2id and word_embedding.pkl ? #29

yangliuy commented Mar 5, 2019

xyzhou-puck commented Mar 5, 2019

yangliuy commented Mar 5, 2019

xyzhou-puck commented Mar 5, 2019

MASTERPlECE commented Apr 16, 2019

How did you generate the input data files like data.pkl, word2id and word_embedding.pkl ? #29

How did you generate the input data files like data.pkl, word2id and word_embedding.pkl ? #29

Comments

yangliuy commented Mar 5, 2019

xyzhou-puck commented Mar 5, 2019

yangliuy commented Mar 5, 2019

xyzhou-puck commented Mar 5, 2019

MASTERPlECE commented Apr 16, 2019