Google launches Kubernetes support for Cloud Dataproc

Google Cloud Dataproc, a Google Cloud service for running Apache Spark and Hadoop clusters, will be released for Kubernetes. The service ensures that users of Apache Spark and Hadoop don’t have to manage their infrastructure.

Cloud Dataprox will initially be launched as an alpha version on Kubernetes. The aim of the launch is for enterprise organisations to be able to run Apache Spark workloads on Google Kubernetes Engine Clusters. This means that Dataproc users can migrate their workloads to their own data centres, because GKE is almost universally available, through Google Anthos.

Unified management

Apache Spark workloads often run on Hadoop YARN clusters. Cloud Dataproc ensures that users can manage their clusters from a single overview. In this way, it is no longer necessary to use different cluster management systems. “Supporting both YARN and Kubernetes can bring your enterprise the needed flexibility to modernize certain hybrid workloads while continuing to monitor YARN-based workloads,” says Google.

Expansions in the long term

TechCrunch reports that the service so far only supports Apache Spark, but that Google also wants to support other open-source projects in the future. “Enterprises are increasingly looking for products and services that support data processing across multiple locations and platforms,” said Matt Aslett, research vice president at 451 Research. “The launch of Cloud Dataproc on Kubernetes is significant in that it provides customers with a single control plane for deploying and managing Apache Spark jobs on Google Kubernetes Engine in both public cloud and on-premises environments.” In short, the launch of Google Cloud Dataproc is a new step towards supporting the hybrid cloud.

Expert Talks

Whitepapers

Enhance your data protection strategy for 2025

The Data Protection Guide 2025 explores the essential strategies and...

Google launches Kubernetes support for Cloud Dataproc

Unified management

Expansions in the long term

Stay tuned, subscribe!

The AI trailblazer GitHub Copilot is running out of road

Claude’s creator Anthropic overtakes OpenAI at the IPO game

As Fable 5 returns, Anthropic wants to write the frontier AI rulebook

How Google scaled Kubernetes to 130,000 nodes for AI workloads

Why hyperscalers run containers in VMs: VKS deep dive

How AI agents are transforming Salesforce marketing applications

Why enterprises are choosing HPE for private cloud AI

AMD “Helios”: Building rack-scale AI Infrastructure for EMEA Enterprises

Taking the right lessons from AI success stories

Why traditional security can’t protect your enterprise against AI threats

Power critical workloads with all-NVMe active-active storage for non-stop enterprise operations

GOTO Copenhagen 2026

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices