Google launches VaultGemma: privacy AI without compromising performance

Google presents VaultGemma, an AI model that protects sensitive data without compromising performance. The 1 billion-parameter model uses differential privacy and will be available as open source.

Google Research and Google DeepMind are behind VaultGemma, a language model that solves the privacy issues of traditional AI. The model builds on Google’s Gemma architecture, demonstrating that differential privacy does not necessarily mean reduced performance.

Differential privacy works by adding controlled noise to datasets. This makes it impossible to retrieve specific information while maintaining overall usability. VaultGemma was built from the ground up and trained with a differential privacy framework to ensure that it cannot remember or leak sensitive data.

New scaling laws break through old limitations

Traditional scaling laws for AI models do not apply when differential privacy is applied. Google therefore developed new “DP Scaling Laws” that take into account added noise and larger batch sizes. This breakthrough enables the development of larger and more powerful private language models.

The team adapted the training protocols to counteract the instability caused by noise addition. Private models require batch sizes with millions of examples to train stably. Google found ways to reduce these computational costs without undermining privacy guarantees.

Performance comparable to public models

In evaluations on benchmarks such as MMLU and Big-Bench, VaultGemma performs comparably to non-private Gemma models with the same number of parameters. This is remarkable because previous differential private models always performed significantly worse.

VaultGemma uses a decoder-only transformer architecture with 26 layers and Multi-Query Attention. The sequence length is limited to 1,024 tokens to keep the intensive computational requirements of private training manageable.

Open source for wider adoption

Google is making VaultGemma fully open source via Hugging Face and Kaggle. This contrasts with proprietary models such as Gemini Pro. The new scaling laws should be applicable to much larger private models, potentially up to trillions of parameters. Google envisions collaboration with healthcare providers, with VaultGemma analyzing sensitive patient data without privacy risks.

By refusing to disclose training data, the model also reduces the risks of misinformation and bias amplification, the researchers say.

Tip: Google puts Gemini largely behind a paywall

51 AI agents book your next trip to Australia

How TravelEssence is expanding its travel agency with OutSystems

Laura Herijgers March 5, 2026

ThoughtSpot launches Spotter Semantics for AI agents

ThoughtSpot introduces Spotter Semantics, a semantic layer designed to deliver consistent, reliable insights ...

Berry Zwets March 12, 2026

Meta considers Gemini license after disappointment with its own AI

Meta is postponing the launch of its new AI model Avocado until at least May. The model performs better inter...

Berry Zwets 2 days ago

Top story

“Blind AI deployment leads to knowledge loss and software failures”

Artificial intelligence is rapidly being integrated into business processes, driven by promises of efficiency...

Berry Zwets March 9, 2026

Expert Talks

Tech calendar

Google launches VaultGemma: privacy AI without compromising performance

New scaling laws break through old limitations

Performance comparable to public models

Open source for wider adoption

Stay tuned, subscribe!

The future of generative AI in software testing

Oracle: sovereignty is a matter of trust, not just technology

SUSE in the shop window: will the Linux player remain European?

Salesforce makes Contact Center much more effective with Agentforce

Cisco's 102.4 terabit chip supercharges AI data centers

EU digital sovereignty and policy: Cisco's perspective

Qualcomm tells us how ARM chips will disrupt the enterprise PC market

"Not all clouds are created equal" in the AI era: how is OCI different?

The Zero-Drift Frontier: Modern Edge Demands on Kubernetes

When is an SBOM not an SBOM? CISA’s Minimum Elements

Sovereign: the new normal for AI and cloud native (and how to make it work)

De IT Afdeling van de toekomst

GITEX ASIA 2026

GITEX ASIA 2026

Southeast Asia AI Application Summit 2026

SAS Innovate 2026

Team '26

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices