Release PyTorch 2.0.1 Release, bug fix release · pytorch/pytorch

This release is meant to fix the following issues (regressions / silent correctness):

Fix _canonical_mask throws warning when bool masks passed as input to TransformerEncoder/TransformerDecoder (#96009, #96286)
Fix Embedding bag max_norm=-1 causes leaf Variable that requires grad is being used in an in-place operation #95980
Fix type hint for torch.Tensor.grad_fn, which can be a torch.autograd.graph.Node or None. #96804
Can’t convert float to int when the input is a scalar np.ndarray. #97696
Revisit torch._six.string_classes removal #97863
Fix module backward pre-hooks to actually update gradient #97983
Fix load_sharded_optimizer_state_dict error on multi node #98063
Warn once for TypedStorage deprecation #98777
cuDNN V8 API, Fix incorrect use of emplace in the benchmark cache #97838

Update Multi-Head Attention's doc string #97046
Fix incorrect behavior of is_causal paremeter for torch.nn.TransformerEncoderLayer.forward #97214
Fix error for SDPA on sm86 and sm89 hardware #99105
Fix nn.MultiheadAttention mask handling #98375

Fix regression for pin_memory recursion when operating on bytes #97737
Fix collation logic #97789
Fix Ppotentially backwards incompatible change with DataLoader and is_shardable Datapipes #97287

Fix Convolutions for CUDA-11.8 wheel builds #99451
Fix Import torchaudio + torch.compile crashes on exit #96231
Linux aarch64 wheels are missing the mkldnn+acl backend support - pytorch/builder@54931c2
Linux aarch64 torchtext 0.15.1 wheels are missing for aarch64_linux platform - pytorch/builder#1375
Enable ROCm 5.4.2 manywheel and python 3.11 builds #99552
PyTorch cannot be installed at the same time as numpy in a conda env on osx-64 / Python 3.11 #97031
Illegal instruction (core dumped) on Raspberry Pi 4.0 8gb - pytorch/builder#1370

The release tracker should contain all relevant pull requests related to this release as well as links to related issues

Provide feedback