GradCAM for Dual Attention ViT #474

tayyabapucit · 2023-12-26T19:13:11Z

How can I use Grad CAM for Dual Attention ViT Transformer (davit_tiny.msft_in1k).
I tried to use reshape transform with height,width 7,7 and 14,14
I need to figure out which layer to use or how to reshape the output.

I've successfully used GradCAM, EigenCAM, and ScoreCAM with ResNet, DenseNets, VGG, ViT, and SwinVit. Results are amazing for all these models.

Any Help?

tayyabapucit · 2023-12-27T21:26:56Z

Ok, I've figured it out.
It worked with
target_layers = [list(model.backbone.modules())[-13]]

and

def reshape_transform2(tensor, height=7, width=7):

result = tensor.reshape(tensor.size(0),
    height, width, tensor.size(2))

# ??Bring the channels to the first dimension,
# like in CNNs.
result = result.transpose(2, 3).transpose(1, 2)

return result

Am I doint it right.?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GradCAM for Dual Attention ViT #474

GradCAM for Dual Attention ViT #474

tayyabapucit commented Dec 26, 2023

tayyabapucit commented Dec 27, 2023

GradCAM for Dual Attention ViT #474

GradCAM for Dual Attention ViT #474

Comments

tayyabapucit commented Dec 26, 2023

tayyabapucit commented Dec 27, 2023