vision-transformer problem report #1184

ChenDaiwei-99 · 2023-08-24T02:23:12Z

msaroufim · 2023-09-06T18:58:56Z

If you wanna contribute some fixes would be happy to merge

ChenDaiwei-99 · 2023-09-06T19:31:26Z

Sure, will do it in a few days :)

colinosterman · 2023-10-28T01:46:52Z

I can confirm this example is badly broken. I added some code to compare individual labels to predictions and discovered the forward pass of ViT always returns the same tensor. No matter the input. The tensor it returns is different each time I run it, even if I load the same weights from the save file and don't do any training. It's no wonder it can't do better than 2.3. Always giving the same prediction should accidentally hit on the correct label about as often as random guessing.

fanqieguo · 2024-05-13T07:50:44Z

I added the printout of accuracy in the code, but during the training process, the accuracy does not improve, and the loss does not converge, even after training for many epochs. Does this model really work? I think this model has serious problems. I hope to get an answer, this is very important to me

ManukyanD mentioned this issue Dec 25, 2023

Bugfix in vision transformer - save class token and pos embedding #1204

Merged

msaroufim mentioned this issue May 13, 2024

Remove Vision Transformer Example #1258

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-transformer problem report #1184

vision-transformer problem report #1184

ChenDaiwei-99 commented Aug 24, 2023

msaroufim commented Sep 6, 2023

ChenDaiwei-99 commented Sep 6, 2023

colinosterman commented Oct 28, 2023

fanqieguo commented May 13, 2024

vision-transformer problem report #1184

vision-transformer problem report #1184

Comments

ChenDaiwei-99 commented Aug 24, 2023

msaroufim commented Sep 6, 2023

ChenDaiwei-99 commented Sep 6, 2023

colinosterman commented Oct 28, 2023

fanqieguo commented May 13, 2024