You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Only the second one can be decoded as 'tomatoes' correctly. However, in function collate_fn, when batch_size=1, the format of trg is a list contained only one string, this will make edit_inner['labels'] wrong. I want to know if I understand it correctly.
The text was updated successfully, but these errors were encountered:
Thank you for bringing this to our attention! You are indeed correct in your understanding, and I have addressed the issue.
However, this bug does not affect the outcome, as the forward pass of MiniGPT-4 and BLIP-2 returns a class containing logits and labels. We utilize these labels for evaluation, not edit_inner['labels'].
MiniGPTOutput
Labels used in evaluation
My apologies for any inconvenience caused! Please don't hesitate to get in touch if you need any assistance.
I found something weird in VQADataset. When the tokenizer changes the answer to numbers, I found:
Only the second one can be decoded as 'tomatoes' correctly. However, in function collate_fn, when batch_size=1, the format of trg is a list contained only one string, this will make edit_inner['labels'] wrong. I want to know if I understand it correctly.
The text was updated successfully, but these errors were encountered: