Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GeLU on AVX512 VL=256 uses AVX instruction #761

Open
alheinecke opened this issue May 9, 2023 · 0 comments
Open

GeLU on AVX512 VL=256 uses AVX instruction #761

alheinecke opened this issue May 9, 2023 · 0 comments
Assignees

Comments

@alheinecke
Copy link
Collaborator

The unary GeLU TPP uses vblendvps when generating code for AVX512 VL=256. This can lead to hard to debug problems as only the first 16 register are valid for use. This either needs to be enforce and the code needs to error out if this is not possible or the corresponding EVEX instructions should be used.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants