Releases: microsoft/DeepSpeed
Releases · microsoft/DeepSpeed
v0.5.9: Patch release
Various small documentation text improvements (#1665) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
v0.5.8: Patch release
Remove unused import of ssl.OP_ENABLE_MIDDLEBOX_COMPAT (#1601)
v0.5.7: Patch release
Several fixes for our read-the-docs build (#1579)
v0.5.6: Patch release
Fix 1bit extra issue (#1542)
v0.5.5: Patch release
Prevent creation of local temp directory (#1494) Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
v0.5.4: Patch release
Fix the workspace allocation for the transformer kernel (#1397) * fix the workspace allocation for the transformer kernel * change layer-id type & rm one unit test due to OOM
v0.5.3: Patch release
[zero_to_fp32] adapt to 4-bytes alignment in z2 (#1372) Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
v0.5.2: Patch release
Update setup.py (#1361) updated classifiers
v0.5.1: Patch release
Reducing the memory-overhead of creating model for multi-GPU run (#1244) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
DeepSpeed v0.5.0
- Mixture of Experts (MoE) support
- Curriculum learning