Question or bug in blip_pretrain.py #207

LiGuo12 · 2024-04-13T00:45:28Z

Between lines 54-75, after "self.text_encoder = BertModel.from_pretrained('bert-base-uncased',config=encoder_config, add_pooling_layer=False)", where "encoder_config" is 'configs/bert_config.json' , the vocal_size is 31090. Next, after "self.text_encoder.resize_token_embeddings(len(self.tokenizer))", the vocal_size changes to 31092. However, "self.text_encoder_m" does not resize the vocal_size to 31092, and its vocal_size is still 31090. Thus, "self.text_encoder" and "self.text_encoder_m" have problems with "self.copy_params()".

Is it a bug? I think there should be a ""self.text_encoder_m.resize_token_embeddings(len(self.tokenizer))" after the line 67.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question or bug in blip_pretrain.py #207

Question or bug in blip_pretrain.py #207

LiGuo12 commented Apr 13, 2024

Question or bug in blip_pretrain.py #207

Question or bug in blip_pretrain.py #207

Comments

LiGuo12 commented Apr 13, 2024