Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[2.0.1] Disable SDPA FlashAttention backward and mem eff attention on…
… sm86+ for head_dim above 64 (#99736) * Disable SDPA FlashAttention backward and mem eff attention on sm86+ for head_dim above 64 (#99105) Expand sdpa_utils.h check to disable FlashAttention when using autograd and mem eff attention for the following cases - head_dim > 64 - sm86 or newer Previously we only disable these kernels on sm86 and for head_dim equal to 128. Pull Request resolved: #99105 Approved by: https://github.com/malfet * remove master only test --------- Co-authored-by: albanD <desmaison.alban@gmail.com>
- Loading branch information
Showing
2 changed files
with
63 additions
and
16 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters