About the VQADataset #254

yaohui120 · 2024-05-11T16:55:57Z

I found something weird in VQADataset. When the tokenizer changes the answer to numbers, I found:

(Pdb) self.tok.encode([data[2]['target']], add_special_tokens=False, return_tensors="pt",)
tensor([[0]])
(Pdb) self.tok.encode(data[2]['target'], add_special_tokens=False, return_tensors="pt",)
tensor([[ 6454, 20452]])

Only the second one can be decoded as 'tomatoes' correctly. However, in function collate_fn, when batch_size=1, the format of trg is a list contained only one string, this will make edit_inner['labels'] wrong. I want to know if I understand it correctly.

The text was updated successfully, but these errors were encountered:

tbozhong · 2024-05-12T07:41:25Z

Thank you for bringing this to our attention! You are indeed correct in your understanding, and I have addressed the issue.

However, this bug does not affect the outcome, as the forward pass of MiniGPT-4 and BLIP-2 returns a class containing logits and labels. We utilize these labels for evaluation, not edit_inner['labels'].

MiniGPTOutput
Labels used in evaluation

My apologies for any inconvenience caused! Please don't hesitate to get in touch if you need any assistance.

yaohui120 · 2024-05-12T13:26:09Z

Oh, I used the old version code, and it's

post_edit_outputs = edited_model(batch["edit_outer"])
post_batch_labels = batch["edit_outer"]["labels"]
if not isinstance(post_edit_outputs, torch.Tensor):
    post_edit_logits = post_edit_outputs.logits
else:
    post_edit_logits = post_edit_outputs

I am testing the current version code. Thank you.

yaohui120 · 2024-05-13T03:45:10Z

I test the newest code on VQA dataset and it doesn't have the problem I mention above. Thank you~

zxlzr added the question Further information is requested label May 12, 2024

tbozhong closed this as completed in fd3bcce May 12, 2024

tbozhong reopened this May 12, 2024

yaohui120 closed this as completed May 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the VQADataset #254

About the VQADataset #254

yaohui120 commented May 11, 2024

tbozhong commented May 12, 2024

yaohui120 commented May 12, 2024

yaohui120 commented May 13, 2024

About the VQADataset #254

About the VQADataset #254

Comments

yaohui120 commented May 11, 2024

tbozhong commented May 12, 2024

yaohui120 commented May 12, 2024

yaohui120 commented May 13, 2024