Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559 opened May 21, 2024 by adk9 Loading…
fix sequence parallel(Ulysses) grad scale for zero0
#5555 opened May 21, 2024 by inkcherry Loading…
Add chatglm2 & chatglm3 autotp
#5540 opened May 16, 2024 by Yejing-Lai Loading…
Fix deadlock in PipeEngine._exec_recv_grads
#5518 opened May 10, 2024 by i4never Loading…
DeepSpeedCheckpoint: support custom final ln idx
#5506 opened May 8, 2024 by nelyahu Loading…
inference: remove unused _validate_args function
#5505 opened May 8, 2024 by nelyahu Loading…
Update to ROCm6
#5491 opened May 1, 2024 by loadams Loading…
Fix training of pipeline based peft's lora model
#5477 opened Apr 29, 2024 by xuanhua Loading…
Universal checkpoint for zero stage 3
#5475 opened Apr 29, 2024 by xylian86 Loading…
Add Compressedbackend for Onebit optimizers
#5473 opened Apr 28, 2024 by Liangliang-Ma Loading…
[Draft][Demo] auto tp training
#5445 opened Apr 22, 2024 by inkcherry Loading…
enable yuan autotp & add conv tp
#5428 opened Apr 17, 2024 by Yejing-Lai Loading…
uniform deepspeed overflow check
#5424 opened Apr 16, 2024 by GuanhuaWang Loading…
Adding DS Feature API in accelerator
#5423 opened Apr 16, 2024 by duli2012 Loading…
fix dist attn reshape error
#5366 opened Apr 5, 2024 by tkdcjf159 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.