-
Notifications
You must be signed in to change notification settings - Fork 236
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add documentation for dot product attention
#889
opened Jun 4, 2024 by
cyanguwa
Loading…
2 of 4 tasks
Get CMake bin dir from CMake module if possible
bug
Something isn't working
build
Build system
#888
opened Jun 4, 2024 by
timmoon10
Loading…
3 of 11 tasks
Use unoptimized RMSNorm kernel if pointers are not aligned
bug
Something isn't working
#886
opened Jun 3, 2024 by
timmoon10
Loading…
4 of 11 tasks
[PyTorch] Add support for cuDNN FusedAttention + THD + CP
#885
opened Jun 3, 2024 by
xrennvidia
Loading…
6 tasks
[JAX] Splitting CPP extensions by category
#883
opened Jun 1, 2024 by
phu0ngng
Loading…
5 of 11 tasks
[JAX] Added unit tests for distributed LayernormMLP
#878
opened May 29, 2024 by
phu0ngng
Loading…
4 of 9 tasks
[PyTorch] Avoid select op in PyTorch extensions
enhancement
New feature or request
#865
opened May 24, 2024 by
timmoon10
Loading…
6 of 11 tasks
[C/PyTorch/JAX] Build system improvements for rpath and C++11 ABI
build
Build system
enhancement
New feature or request
#858
opened May 20, 2024 by
denera
Loading…
9 of 11 tasks
[Common/PyTorch] Grouped GEMM via multi-stream cuBLAS
#853
opened May 17, 2024 by
yaox12
Loading…
8 of 11 tasks
[JAX] Rewrite the Format of FP8 Meta and Remove unused ShardingTypes.
#842
opened May 13, 2024 by
mingxu1067
Loading…
8 of 11 tasks
[Pytorch] Implement fp32 accumulation for attention with context parallel in both forward and backward pass.
#821
opened Apr 28, 2024 by
Yuxin-CV
Loading…
[PyTorch] Fix minor bug in computing num_gqa_groups_per_partition
bug
Something isn't working
#777
opened Apr 13, 2024 by
knowlsie
Loading…
[C/PyTorch] Refactor and move userbuffers into TE/common
#760
opened Apr 8, 2024 by
denera
Loading…
6 of 13 tasks
[PyTorch] Sequential fuser
enhancement
New feature or request
#707
opened Mar 9, 2024 by
timmoon10
Loading…
2 of 6 tasks
[PyTorch] Distributed intermediate/activation tensors for FSDP
#687
opened Feb 28, 2024 by
denera
Loading…
Remove now useless padding as it is now down automatically.
jax
#680
opened Feb 25, 2024 by
nouiz
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.