Nvidia upgrades NeMo Megatron AI toolkit to accelerate AI training

Nvidia created a new NeMo Megatron AI development toolkit to make AI training faster.

Nvidia announced a new edition of the NeMo Megatron AI toolkit that will allow software teams and staff to train and accelerate neural networks. The update particularly promises to minimize the time required to train developers to make sophisticated NLP models.

GPT-3 and NeMo megatron

OpenAI LLC – an AI research and development company – introduced an advanced NLP model titled ‘Generative Pre-Trained Transformer 3’ or GPT-3 This model can execute various tasks such as translating texts and producing software codes. OpenAI offers commercial cloud services that allow organizations to access several specialized GPT-3 editions and create customized versions.

NeMo Megatron is the AI toolkit Nvidia upgraded and includes features that can help train for GPT-3 models. The American multinational technology company believes the features will facilitate a 30 percent decrease in the time it takes to train developers.

“Training can now be done on 175 billion-parameter models using 1,024 NVIDIA A100 GPUs in just 24 days — reducing time to results by 10 days, or some 250,000 hours of GPU computing, prior to these new releases”, researchers stated.

Key features

This speed-up is due to two key features: ‘selective activation recomputation’ and ‘sequence parallelism’. As per Nvidia, both features accelerate artificial intelligence training differently.

‘Sequence parallelism’ utilizes layers that help build software that expedites processing. It can parallelize computations that could be executed only after one another, increasing performance. Moreover, it also minimizes the requirement to perform similar calculations several times.

‘Selective activation recomputation’ further minimizes overall calculation numbers. Various artificial intelligence models employ computing operations (activations) to process information. NeMo Megatron efficiently performs activation computations, reducing training times.

“We arrived at the optimal training configuration for a 175B GPT-3 model in under 24 hours”, Nvidia’s researchers said. “Compared with a common configuration that uses full activation recomputation, we achieve a 20 percent to 30 percent throughput speed-up.”

Chris Wright: AI needs model, accelerator, and cloud flexibility

Red Hat is repositioning its platform strategy to meet the shifting demands of enterprise AI. The company’s...

Berry Zwets 2 days ago

Risk of sabotage of undersea internet cables increases due to Russia and China

The threat of sabotage to submarine internet cables by state actors such as Russia and China is increasing. ...

Mels Dees 14 hours ago

Top story

AI requires mature choices from companies

The rapid rise of AI is putting pressure on organizations to review their infrastructure and working methods....

Berry Zwets July 15, 2025

Expert Talks

Tech calendar

Nvidia upgrades NeMo Megatron AI toolkit to accelerate AI training

GPT-3 and NeMo megatron

Key features

Stay tuned, subscribe!

HPE reaches agreement with Elliott: investor gets influence in strategy

Chris Wright: AI needs model, accelerator, and cloud flexibility

Dutch Department of Justice offline after Citrix vulnerability

Managing the AI chaos with ServiceNow's AI Control Tower

The unique IT challenges of Carnival Cruise Line's "floating cities"

"AI puts process modeling on steroids", SAP's Dee Houchen on business process management

The impact of OpsRamp on HPE and its integration into the stack

How AI and automation are redefining ROI in the enterprise

Enhancing video encoding: The AV1 support in the new ARTPEC-9 System-on-Chip

How organisations can remain compliant while building resiliency during the AI era

GITEX DIGI_HEALTH 5.0 - Thailand

IT Arena

Innovation Week 2025

Luxembourg Venture Days

Appdevcon

Webdevcon

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices