Upgrade & Secure Your Future with DevOps, SRE, DevSecOps, MLOps!

We spend hours on Instagram and YouTube and waste money on coffee and fast food, but won’t spend 30 minutes a day learning skills to boost our careers.
Master in DevOps, SRE, DevSecOps & MLOps!

Learn from Guru Rajesh Kumar and double your salary in just one year.

Get Started Now!

Microsoft speeds up PyTorch with DeepSpeed

Source: sg.channelasia.tech

Microsoft has released DeepSpeed, a new deep learning optimisation library for PyTorch, that is designed to reduce memory use and train models with better parallelism on existing hardware.

According to a Microsoft Research blog post announcing the new framework, DeepSpeed improves PyTorch model training through a memory optimisation technology that increases the number of possible parameters a model can be trained with, makes better use of the memory local to the GPU, and requires only minimal changes to an existing PyTorch application to be useful.

It’s the minimal impact on existing PyTorch code that has the greatest potential impact. As machine learning libraries grow entrenched, and more applications become dependent on them, there is less room for new frameworks, and more incentive to make existing frameworks more performant and scalable.

PyTorch is already fast when it comes to both computational and development speed, but there’s always room for improvement. Applications written for PyTorch can make use of DeepSpeed with only minimal changes to the code; there’s no need to start from scratch with another framework.

One way DeepSpeed enhances PyTorch is by improving its native parallelism.

In one example, provided by Microsoft in the DeepSpeed documentation, attempting to train a model using PyTorch’s Distributed Data Parallel system across Nvidia V100 GPUs with 32GB of device memory “[ran] out of memory with 1.5 billion parameter models,” while DeepSpeed was able to reach 6 billion parameters on the same hardware.

Another touted DeepSpeed improvement is more efficient use of GPU memory for training. By partitioning the model training across GPUs, DeepSpeed allows the needed data to be kept close at hand, reduces the memory requirements of each GPU, and reduces the communication overhead between GPUs.

A third benefit is allowing for more parameters during model training to improve prediction accuracy. Hyperparameter optimisation, which refers to tuning the parameters or variables of the training process itself, can improve the accuracy of a model but typically at the cost of manual effort and expertise.

To eliminate the need for expertise and human effort, many machine learning frameworks now support some kind of automated hyperparameter optimisation.

With DeepSpeed, Microsoft claims that “deep learning models with 100 billion parameters” can be trained on “the current generation of GPU clusters at three to five times the throughput of the current best system.”

DeepSpeed is available as free open source under the MIT License. Tutorials in the official repo work with Microsoft Azure, but Azure is not required to use DeepSpeed.

Related Posts

Complete Azure Certification Guide & tutorials

What is Azure? Azure is launched by Microsoft in February 2010. It is a cloud computing platform including Infrastructure as a Service (IaaS), Platform as a Service Read More

Read More

Microsoft and VMware Team Up to Offer the Next Evolution of the Azure VMware Solution

Source:-cio Today’s CIOs and enterprise IT departments continue to look for ways to rapidly expand their critical infrastructure to the cloud. The next evolution of Azure VMware Read More

Read More

New Microsoft program to help develop the quantum computing workforce of the future in India

Source:-microsoft Microsoft is creating a new program to build quantum computing skills and capabilities in the academic community in India. As part of this initiative, Microsoft Garage Read More

Read More

DATA SCIENCE AND MACHINE LEARNING PLATFORMS MARKET INSIGHTS BUSINESS OPPORTUNITIES 2027

Source:-primefeed Analysis of COVID-19 Impact on the Data Science and Machine Learning Platforms Market with Key Players Analysis The report highlights the current impact of COVID-19 on Read More

Read More

Microsoft targets its fastest Azure AI instance to date at large neural networks

SOURCE:-siliconangle Microsoft Corp. today previewed a new Azure instance for training artificial intelligence models that targets the emerging class of advanced, ultra-large neural networks being pioneered by Read More

Read More

AtScale Expands Reach of Cloud OLAP with Support for Microsoft Azure Databricks in

Source:-finance.yahoo the intelligent data virtualization provider for advanced analytics, today announced an enhanced platform integration with Azure Databricks. The integration extends AtScale’s Cloud Online Analytical Processing (COLAP) Read More

Read More
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x