You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! Thank you for the amazing work with including registers and including their checkpoints. I was trying to reproduce the results from Table 3 in "Vision Transformers Need Registers" paper: LOST unsupervised object discovery using the features of the ViT. For some reason, I'm unable to reproduce the number for DINOv2+reg on any of the datasets. We get ~35.94 for the VOC12 dataset, and ~23.39 for the COCO dataset using the official LOST implementation and the official checkpoints of DINOv2+reg (from this github codebase).
We suspect it may be due to the distillation process; is there some way that the authors can confirm this is the case? Can they possibly share the evaluation setting for the results on LOST object discovery?
Many thanks!
The text was updated successfully, but these errors were encountered:
Hi! Thank you for the amazing work with including registers and including their checkpoints. I was trying to reproduce the results from Table 3 in "Vision Transformers Need Registers" paper: LOST unsupervised object discovery using the features of the ViT. For some reason, I'm unable to reproduce the number for DINOv2+reg on any of the datasets. We get ~35.94 for the VOC12 dataset, and ~23.39 for the COCO dataset using the official LOST implementation and the official checkpoints of DINOv2+reg (from this github codebase).
We suspect it may be due to the distillation process; is there some way that the authors can confirm this is the case? Can they possibly share the evaluation setting for the results on LOST object discovery?
Many thanks!
The text was updated successfully, but these errors were encountered: