-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
problem in dataloader.py #13
Comments
How did you obtain the Dataset? Did you crawl it based on data.json? @DimPenCHEN |
I obtaioned the data from the public dataset which in huggingface.co. I can send the detail address to you later(I can't find it right now). |
@pp-jia https://huggingface.co/datasets/MischaQI/FakeSV/tree/main |
Thanks, you can contact me via the email address on my homepage. @DimPenCHEN |
The data.json file does not contain the 'label' column. replace_values = {'辟谣': 2, '假': 1, '真':0}
self.data_complete['annotation'] = self.data_complete['annotation'].replace(replace_values)
self.data_complete = self.data_complete[self.data_complete['annotation']!=2]
#self.data_complete = self.data_complete[self.data_complete['label']!=2] # label: 0-real, 1-fake, 2-debunk |
If you follow the above, be sure to modify the following code into label = 0 if item['annotation'] == '真' else 1 to label = 0 if item['annotation'] == 0 else 1 Otherwise, there will be a significant error during the training phase. |
I have solved the data problems(I think),but I found that the file "dataloader.py" using the "label" key in data.json which is no in my data.json
self.data_complete = self.data_complete[self.data_complete['label']!=2] # label: 0-real, 1-fake, 2-debunk
BTW, I haven't obatined the data directory "vids" and **"ptvgg19_frame_thumb"**which aren't mentioned in the public dataset.
I wonder how to sovle this problem in dataloader.py TwT
The text was updated successfully, but these errors were encountered: