AWS offers GPU power for short AI workloads

AWS is introducing Amazon EC2 Capacity Blocks for ML. The new service gives enterprises easy access to cloud-based GPU compute power for short AI workloads.

Companies seeking compute power for short AI workloads can now get it from AWS with Amazon EC2 Capacity Blocks for ML. This, according to AWS, saves them from having to purchase this capability themselves for longer-term workloads. That method is usually not cost-effective, as it often doesn’t get enough use to make financial sense. In addition, customers do not always get access to the much-needed Nvidia GPUs for their AI workloads.

Introducing Amazon EC2 Capacity Blocks for ML

With AWS’ new service, customers can solve this problem. The service reserves capacity for customers on hundreds of Nvidia H100 GPUs hosted on a colocation basis in the Amazon EC2 UtraClusters for high-performance ML workloads.

To access these Amazon EC2 Capacity Blocks for ML, users choose the desired cluster size, future start date and required duration for workloads. This provides predictable and immediate access to the GPU resources required for AI projects. AWS compares the consumption model to a hotel reservation for a specific length of stay, but for GPU instances for AI projects.

Een schermafbeelding van Adobe Adobe Adobe Adobe Adobe.

The Amazon EC2 Capacity Blocks service is also akin to an old-fashioned mainframe architecture, according to experts. Those computing environments were once deployed as “timeshare computers” that supported hundreds of users simultaneously for different workloads.

Availability and pricing

Current AWS customers can now reserve GPU capacity within Amazon EC2 Capacity Blocks for ML via the AWS Management Console, CLI or SDK. The service is first available in the AWS US East (Ohio) region. The number of AWS regions will continue to expand in the near future. However, the cost of using the service is not too low, according to the released overview.

Also read: AWS launches standalone sovereign cloud in Europe

SAP CEO says EU doesn’t need a massive AI buildout. Is he right?

SAP CEO Christian Klein considers it unnecessary to implement a massive AI-driven expansion of data centers i...

Erik van Klinken 2 days ago

Oracle invests billions in AI infrastructure for OpenAI

OpenAI and Oracle have expanded their collaboration with a significant agreement for data center capacity in ...

Mels Dees 3 days ago

Top story

AI only works if the infrastructure is right

AI is in the spotlight, but without a robust infrastructure, it remains a promise. How do you ensure that you...

Berry Zwets July 1, 2025

Whitepapers

AWS offers GPU power for short AI workloads

Introducing Amazon EC2 Capacity Blocks for ML

Availability and pricing

Stay tuned, subscribe!

What is HPE VME and is it a direct competitor to VMware’s hypervisor?

SAP CEO says EU doesn’t need a massive AI buildout. Is he right?

HPE OpsRamp plays a very important role in the platform

AI only works if the infrastructure is right

Thales covers data security entirety thanks to Imperva

AI is an additional weapon for cybersecurity

AI, quantum threats, and the evolution of securing the endpoint at HP (Ian Pratt, HP)

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices

GITEX DIGI_HEALTH 5.0 - Thailand

IT Arena

Innovation Week 2025

Luxembourg Venture Days

Appdevcon

Webdevcon