DeepSpeed - Reference.org

On this page

Original author(s) Microsoft Research

Developer(s) Microsoft

Initial release May 18, 2020; 12 months ago

Stable release v0.3.16 / April 30, 2021; 39 days ago

Written in Python, CUDA, C++

Type Software library

License MIT License

Additional information

DeepSpeed

Microsoft open source library

DeepSpeed is an open source deep learning optimization library for PyTorch.

We don't have any images related to DeepSpeed yet.

You can add one yourself here.

We don't have any YouTube videos related to DeepSpeed yet.

You can add one yourself here.

We don't have any PDF documents related to DeepSpeed yet.

You can add one yourself here.

We don't have any Books related to DeepSpeed yet.

You can add one yourself here.

We don't have any archived web articles related to DeepSpeed yet.

You can submit a link to a page to archive here.

Library

The library is designed to reduce computing power and memory use and to train large distributed models with better parallelism on existing computer hardware.² ³ DeepSpeed is optimized for low latency, high throughput training. It includes the Zero Redundancy Optimizer (ZeRO) for training models with 1 trillion or more parameters.⁴ Features include mixed precision training, single-GPU, multi-GPU, and multi-node training as well as custom model parallelism. The DeepSpeed source code is licensed under MIT License and available on GitHub.⁵

The team claimed to achieve up to a 6.2x throughput improvement, 2.8x faster convergence, and 4.6x less communication.⁶

External links

References

"Microsoft Updates Windows, Azure Tools with an Eye on The Future". PCMag UK. May 22, 2020. https://uk.pcmag.com/news-analysis/127085/microsoft-updates-windows-azure-tools-with-an-eye-on-the-future ↩
Yegulalp, Serdar (February 10, 2020). "Microsoft speeds up PyTorch with DeepSpeed". InfoWorld. https://www.infoworld.com/article/3526449/microsoft-speeds-up-pytorch-with-deepspeed.html ↩
"Microsoft unveils "fifth most powerful" supercomputer in the world". Neowin. 18 June 2023. https://www.neowin.net/news/microsoft-unveils-fifth-most-powerful-supercomputer-in-the-world/ ↩
"Microsoft trains world's largest Transformer language model". February 10, 2020. https://venturebeat.com/2020/02/10/microsoft-trains-worlds-largest-transformer-language-model/ ↩
"microsoft/DeepSpeed". July 10, 2020 – via GitHub. https://github.com/microsoft/DeepSpeed ↩
"DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression". Microsoft Research. 2021-05-24. Retrieved 2021-06-19. https://www.microsoft.com/en-us/research/blog/deepspeed-accelerating-large-scale-model-inference-and-training-via-system-optimizations-and-compression/ ↩

Library

See also

Further reading

External links

References