Pull requests: microsoft/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
z3 scaled_global_grad_norm: repalce get_global_norm with torch.norm
#5504
opened May 7, 2024 by
nelyahu
Loading…
Make the quantized data shape compatible with original tensor shape
#5483
opened Apr 30, 2024 by
sfc-gh-reyazda
Loading…
[XPU] support op builder from intel_extension_for_pytorch kernel path
#5425
opened Apr 17, 2024 by
YizhouZ
Loading…
Add fp16 support of Qwen1.5MoE models (A2.7B) to DeepSpeed-FastGen
#5403
opened Apr 12, 2024 by
ZonePG
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.