Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add documentation for dot product attention
#889 opened Jun 4, 2024 by cyanguwa Loading…
2 of 4 tasks
Get CMake bin dir from CMake module if possible bug Something isn't working build Build system
#888 opened Jun 4, 2024 by timmoon10 Loading…
3 of 11 tasks
Use unoptimized RMSNorm kernel if pointers are not aligned bug Something isn't working
#886 opened Jun 3, 2024 by timmoon10 Loading…
4 of 11 tasks
[PyTorch] Add support for cuDNN FusedAttention + THD + CP
#885 opened Jun 3, 2024 by xrennvidia Loading…
6 tasks
[JAX] Splitting CPP extensions by category
#883 opened Jun 1, 2024 by phu0ngng Loading…
5 of 11 tasks
Fp8 model init factory
#880 opened May 30, 2024 by sudhakarsingh27 Draft
[JAX] Added unit tests for distributed LayernormMLP
#878 opened May 29, 2024 by phu0ngng Loading…
4 of 9 tasks
Build system refactor for wheels
#877 opened May 29, 2024 by ksivaman Loading…
8 of 11 tasks
[PyTorch] Avoid select op in PyTorch extensions enhancement New feature or request
#865 opened May 24, 2024 by timmoon10 Loading…
6 of 11 tasks
Avoid framework specific import from top level enhancement New feature or request
#862 opened May 22, 2024 by ksivaman Draft
6 of 11 tasks
[C/PyTorch/JAX] Build system improvements for rpath and C++11 ABI build Build system enhancement New feature or request
#858 opened May 20, 2024 by denera Loading…
9 of 11 tasks
[Common/PyTorch] Grouped GEMM via multi-stream cuBLAS
#853 opened May 17, 2024 by yaox12 Loading…
8 of 11 tasks
Generation tutorial for Gemma model
#829 opened May 1, 2024 by pggPL Loading…
8 of 11 tasks
[UB] Adding support for multinode nvlink
#815 opened Apr 26, 2024 by shamisp Loading…
Bug fix in DGRAD->RS overlap
#802 opened Apr 23, 2024 by vasunvidia Draft
[PyTorch] Fix minor bug in computing num_gqa_groups_per_partition bug Something isn't working
#777 opened Apr 13, 2024 by knowlsie Loading…
[C/PyTorch] Refactor and move userbuffers into TE/common
#760 opened Apr 8, 2024 by denera Loading…
6 of 13 tasks
Fix bhss bias format before sm90
#736 opened Mar 27, 2024 by zlsh80826 Loading…
[PyTorch] Sequential fuser enhancement New feature or request
#707 opened Mar 9, 2024 by timmoon10 Loading…
2 of 6 tasks
ProTip! Follow long discussions with comments:>50.