The Wayback Machine - https://web.archive.org/web/20240926192608/https://github.com/microsoft/DeepSpeed/pulls
Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[COMPILE] workflow for deepspeed + torch.compile
#6570 opened Sep 25, 2024 by YizhouZ Loading…
Clean up prefetched parameters
#6557 opened Sep 21, 2024 by tohtana Loading…
Improve consistency of zero_grad
#6554 opened Sep 18, 2024 by tohtana Draft
Enabled Qwen2-MoE Tensor Parallelism (TP) inference
#6551 opened Sep 18, 2024 by gyou2021 Loading…
Fix expert grad scaling problem with ZeRO optimizer
#6546 opened Sep 17, 2024 by wyooyw Loading…
[XPU] Support DeepNVMe new code structure
#6532 opened Sep 13, 2024 by Liangliang-Ma Loading…
Set shuffle=True by default in data_sampler
#6531 opened Sep 13, 2024 by ranzhejiang Loading…
Fix device selection using CUDA_VISIBLE_DEVICES
#6530 opened Sep 12, 2024 by tohtana Loading…
add bfloat16 to inference support dtypes
#6528 opened Sep 12, 2024 by nelyahu Loading…
Fix dynamo issue
#6527 opened Sep 12, 2024 by oraluben Loading…
Handle when backend is also in compile_kwargs
#6502 opened Sep 7, 2024 by oraluben Loading…
Adding the new feature of FPDT
#6462 opened Aug 29, 2024 by YJHMITWEB Loading…
sequence parallel for uneven heads
#6392 opened Aug 21, 2024 by inkcherry Loading…
Add weights_only=True in torch.load
#6094 opened Aug 17, 2024 by terry-for-github Loading…
[NaN check] Add NaN check to support bfloat16.
#5879 opened Aug 8, 2024 by ys950902 Loading…
Fix circular import in ds_transformer.py
#5804 opened Jul 28, 2024 by sznmelvin Loading…
Hybrid Offloading for ZeRO3
#5625 opened Jun 7, 2024 by tohtana Draft
ProTip! What’s not been updated in a month: updated:<2024-08-26.