Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about your dataset #11

Open
DeathYmz opened this issue Jul 30, 2020 · 10 comments
Open

about your dataset #11

DeathYmz opened this issue Jul 30, 2020 · 10 comments

Comments

@DeathYmz
Copy link

Hello, can I ask how you generate validate_id.pickle/train_id.pickle/test_id.pickle?

@lt15523290043
Copy link

Hello, can I ask how you generate validate_id.pickle/train_id.pickle/test_id.pickle?
hello,Do you know how to generate the files now?

@DeathYmz
Copy link
Author

DeathYmz commented May 15, 2021 via email

@lt15523290043
Copy link

Thank you very much for your reply. Could you tell me the relevant codes of this file of w2v.pickle

@lt15523290043
Copy link

Hello, can I ask how you generate validate_id.pickle/train_id.pickle/test_id.pickle?

Thank you very much for your reply,Could you tell me the relevant codes of this file of w2v.pickle ?It's very important to me. I'm a beginner. I'm sorry to bother you

@DeathYmz
Copy link
Author

DeathYmz commented May 16, 2021 via email

@lt15523290043
Copy link

The code on git is pretty good, you can take a closer look and you can understand the usefulness of each def get_data(text_only):     #text_only = False     if text_only:         print("Text only")         image_list = []     else:         print("Text and image")         image_list = read_image()     train_data = write_data("train", image_list, text_only)     valiate_data = write_data("validate", image_list, text_only)     test_data = write_data("test", image_list, text_only)     print("loading data...")     # w2v_file = '../Data/GoogleNews-vectors-negative300.bin'     vocab, all_text = load_data(train_data, valiate_data, test_data)     # print(str(len(all_text)))     print("number of sentences: " + str(len(all_text)))     print("vocab size: " + str(len(vocab)))     max_l = len(max(all_text, key=len))     print("max sentence length: " + str(max_l))     word_embedding_path = "../EANN-KDD18-master/Data/weibo/w2v.pickle"     w2v = pickle.load(open(word_embedding_path,"rb"),encoding='bytes')     # print(w2v)     # input("w2v over")     print("word2vec loaded!")     print("num words already in word2vec: " + str(len(w2v)))          add_unknown_words(w2v, vocab)     W, word_idx_map = get_W(w2v)     # # rand_vecs = {}     # # add_unknown_words(rand_vecs, vocab)     W2 = rand_vecs = {}     w_file = open("../EANN-KDD18-master/Data/weibo/word_embedding.pickle", "wb")     pickle.dump([W, W2, word_idx_map, vocab, max_l], w_file)     w_file.close()     return train_data, valiate_data, test_data

------------------ 原始邮件 ------------------ 发件人: @.>; 发送时间: 2021年5月16日(星期天) 上午10:08 收件人: @.>; 抄送: @.>; @.>; 主题: Re: [yaqingwang/EANN-KDD18] about your dataset (#11) Hello, can I ask how you generate validate_id.pickle/train_id.pickle/test_id.pickle? Thank you very much for your reply,Could you tell me the relevant codes of this file of w2v.pickle ?It's very important to me. I'm a beginner. I'm sorry to bother you — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

I used Weibo files and it worked ,but I don't know how to use Twitter data.Please tell me whether you useTwitter files for your experiment?

@youran521
Copy link

Hello, can I ask how you generate validate_id.pickle/train_id.pickle/test_id.pickle?Thank you!

@Dxy-cpu
Copy link

Dxy-cpu commented Jan 23, 2022

Hello, can I ask how you generate validate_id.pickle/train_id.pickle/test_id.pickle?

Thank you very much for your reply,Could you tell me the relevant codes of this file of w2v.pickle ?It's very important to me. I'm a beginner. I'm sorry to bother you

hello,I have the same problem.could you please tell me how should I do?

@Dxy-cpu
Copy link

Dxy-cpu commented Jan 23, 2022

my email adress is ding_xinyang0606@163.com

@balabalacc
Copy link

I have the same problem. Can you tell me how to solve it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants