95% of GPU capacity goes unused in Kubernetes clusters

Based on data from tens of thousands of clusters, GPU usage averages just 5 percent, CPU usage stands at 8 percent, and memory utilization comes in at 20 percent. The gap between paid and used capacity is growing, while cloud prices are rising.

This is demonstrated by research from Cast AI. It reveals a persistent pattern: the gap between what organizations pay and what they actually use widens as Kubernetes adoption grows. This is striking because Kubernetes was specifically designed to deliver efficiency at scale.

Cast AI notes that Kubernetes is becoming the standard platform for AI and ML workloads, but the data tells the same story as with CPU and memory: an average utilization rate of 5 percent. Meanwhile, an idle GPU costs dollars per hour, whereas an unused CPU costs cents per hour.

One-time configuration falls short

A key insight from the report concerns the approach to configuration. Rightsizing—where IT resources are aligned with the needs of the workloads —is not true rightsizing. It occurs only once during deployment. Workloads change, traffic patterns shift. What was true six months ago no longer applies today. The same applies to Spot Instance selection, autoscaler configuration, and node lifecycle management.

Cast AI advocates for autonomous, continuous optimization as a sustainable response to infrastructure economics moving in the wrong direction.

Tip: Harness secures AI code and AI apps with two new modules

Stay tuned, subscribe!

Hackers easily bypass ChatGPT’s guardrails and make it create a lot of malware

Claude Fable 5 is Mythos for the masses

The Digital Workforce calls for a new CISO

OpenAI makes official move for an IPO

Buying GPUs doesn't deliver AI value, according to AWS

NetApp balances sovereignty with AI infrastructure needs

How OpenObserve cuts observability costs by 140x

Cisco wants to tackle the 80-tool security problem

Taking the right lessons from AI success stories

Why traditional security can’t protect your enterprise against AI threats

Power critical workloads with all-NVMe active-active storage for non-stop enterprise operations

Five tips for embracing continuous deployment as a DevOps mindset

VivaTech

GITEX AI EUROPE 2026

GOTO Copenhagen 2026

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices