Measuring Patch Importance in ViT's (Vanilla & Attention Rollout)

Developed methods to analyze patch importance in Vision Transformers (ViTs) by leveraging attention scores of the [CLS] token across attention matrices in multi-head self-attention (MHSA) blocks. Visualized the distribution of top-k patch tokens with respect to the [CLS] token, highlighting critical regions contributing to model predictions.

Additionally, implemented Attention Rollout to recursively propagate attention scores across layers, producing interpretable visualizations of information flow in self-attention mechanisms. This approach enhances understanding of attention-based models and their decision-making processes.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
images		images
notebooks		notebooks
plots		plots
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Measuring Patch Importance in ViT's (Vanilla & Attention Rollout)

About

Uh oh!

Releases

Packages

Languages

License

arnavsm/vit-patch-importance

Folders and files

Latest commit

History

Repository files navigation

Measuring Patch Importance in ViT's (Vanilla & Attention Rollout)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages