Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Hybrid Offloading for ZeRO3
#5625 opened Jun 7, 2024 by tohtana Draft
fix: quantization with DeepSpeed HE
#5624 opened Jun 6, 2024 by Atry Loading…
Add support for Phi-3 small to FastGen
#5614 opened Jun 4, 2024 by adk9 Draft
fixes in _partition_param_sec function
#5613 opened Jun 4, 2024 by mmhab Loading…
[INF] Enable torch compile for inference
#5612 opened Jun 4, 2024 by oelayan7 Loading…
Upgrade HPU image to v1.16.0.
#5610 opened Jun 4, 2024 by vshekhawat-hlab Loading…
Fixed Windows inference build.
#5609 opened Jun 3, 2024 by costin-eseanu Loading…
pipe/_exec_backward_pass: fix immediate grad update
#5605 opened Jun 3, 2024 by nelyahu Loading…
state_dict_factory: llama checkpoint - support SWIGLU
#5601 opened Jun 2, 2024 by nelyahu Loading…
Update profiler.py
#5584 opened May 29, 2024 by gameofdimension Loading…
reduce cpu host overhead when using moe
#5578 opened May 29, 2024 by ranzhejiang Loading…
Reuse KV cache of prefixes
#5572 opened May 27, 2024 by tohtana Draft
Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559 opened May 21, 2024 by adk9 Loading…
Add chatglm2 & chatglm3 autotp
#5540 opened May 16, 2024 by Yejing-Lai Loading…
Fix deadlock in PipeEngine._exec_recv_grads
#5518 opened May 10, 2024 by i4never Loading…
inference: remove unused _validate_args function
#5505 opened May 8, 2024 by nelyahu Loading…
ProTip! What’s not been updated in a month: updated:<2024-05-10.