Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

DeepSpeedCheckpoint: support custom final ln idx
#5506 opened May 8, 2024 by nelyahu Loading…
inference: remove unused _validate_args function
#5505 opened May 8, 2024 by nelyahu Loading…
Fused adam for HPU
#5500 opened May 5, 2024 by BacharL Loading…
Update to ROCm6
#5491 opened May 1, 2024 by loadams Loading…
Fix training of pipeline based peft's lora model
#5477 opened Apr 29, 2024 by xuanhua Loading…
Universal checkpoint for zero stage 3
#5475 opened Apr 29, 2024 by xylian86 Loading…
Add Compressedbackend for Onebit optimizers
#5473 opened Apr 28, 2024 by Liangliang-Ma Loading…
New integration - CometMonitor
#5466 opened Apr 25, 2024 by alexkuzmik Loading…
[Draft][Demo] auto tp training
#5445 opened Apr 22, 2024 by inkcherry Loading…
enable phi2 autotp
#5436 opened Apr 19, 2024 by Yejing-Lai Loading…
enable yuan autotp & add conv tp
#5428 opened Apr 17, 2024 by Yejing-Lai Loading…
uniform deepspeed overflow check
#5424 opened Apr 16, 2024 by GuanhuaWang Loading…
Adding DS Feature API in accelerator
#5423 opened Apr 16, 2024 by duli2012 Loading…
Optimize zero3 fetch params using all_reduce
#5420 opened Apr 16, 2024 by deepcharm Loading…
CPUAdam fp16 and bf16 support
#5409 opened Apr 14, 2024 by BacharL Loading…
Rocm warp size fix
#5402 opened Apr 11, 2024 by rraminen Loading…
rocblas -> hipblas changes for ROCm
#5401 opened Apr 11, 2024 by rraminen Loading…
ProTip! Adding no:label will show everything without a label.