Only 3x3 conv can use 2:4 sparsity? #3849

WeixiangXu · 2024-05-08T12:12:36Z

I use nsys to profile the resnet34.

And I find that only 3x3 conv use 2:4 sparsity, while 1x1 conv does not. (Furthermore, I find that Linear in transformers also does not use sparsity.)

Why?

nvpohanh · 2024-05-09T02:46:53Z

TRT tries both sparse and dense tactics and choose a faster one. Based on our experiments, sparse conv kernels are faster than dense conv kernels if C and K are large enough (>256). Could you try increase C and K for the 1x1 Conv and see if sparse conv tactic is chosen?

3x3 Conv effectively increases the C by 9x so it favors sparse kernels.

WeixiangXu · 2024-05-13T10:01:02Z

Could you try increase C and K for the 1x1 Conv

Thanks for your reply!

Does C stand for C_in or C_out? And does K stands for kernel size? @nvpohanh

nvpohanh · 2024-05-13T13:55:56Z

Sorry, should have been clearer: C = C_in, K = C_out

WeixiangXu · 2024-05-14T06:07:39Z

thanks!

zerollzeng assigned nvpohanh May 12, 2024

zerollzeng added the triaged Issue has been triaged by maintainers label May 12, 2024

WeixiangXu closed this as completed May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only 3x3 conv can use 2:4 sparsity? #3849

Only 3x3 conv can use 2:4 sparsity? #3849

WeixiangXu commented May 8, 2024

nvpohanh commented May 9, 2024

WeixiangXu commented May 13, 2024

nvpohanh commented May 13, 2024

WeixiangXu commented May 14, 2024

Only 3x3 conv can use 2:4 sparsity? #3849

Only 3x3 conv can use 2:4 sparsity? #3849

Comments

WeixiangXu commented May 8, 2024

nvpohanh commented May 9, 2024

WeixiangXu commented May 13, 2024

nvpohanh commented May 13, 2024

WeixiangXu commented May 14, 2024