Pull requests: microsoft/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Refactor universal checkpointing and tensor fragments
#2253
opened Aug 23, 2022 by
tjruwase
Loading…
MoE - Token dropping for Full Tensor Paralellism
#2235
opened Aug 18, 2022 by
siddharth9820
Loading…
Ds-inference Int8 support through ZeroQuant technology
#2217
opened Aug 13, 2022 by
RezaYazdaniAminabadi
Loading…
[deepspeed/autotuner] Bug fix for skipping mbs on gas
#2171
opened Aug 2, 2022 by
rahilbathwal5
Loading…
[deepspeed/autotuner] Bug fix for binary search for batch size
#2162
opened Aug 1, 2022 by
rahilbathwal5
Loading…
Small fix in injection module to replace transformer
#2064
opened Jun 28, 2022 by
RezaYazdaniAminabadi
Loading…
Share a list of weight attributes instead of a single one in TiedLayerSpec API
#2035
opened Jun 21, 2022 by
thomasw21
Loading…
Fixing several issue in API and kernels to run inference with model-parallelism
#2028
opened Jun 17, 2022 by
RezaYazdaniAminabadi
Loading…
feat: Support for training with MoE module in PipelineEngine
#1942
opened May 7, 2022 by
shjwudp
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.

