5 min Applications

Google Cloud unveils updates to Gemini, Imagen, Gemma and Vertex AI

Insight: Analytics

Coen van EenbergenApril 9, 2024 4:21 pmJuly 8, 2024

At Google Cloud Next this year, AI once again plays centre stage. Google Cloud is bringing the necessary updates to its existing LLM offerings within Vertex AI. For example, Gemini 1.5 Pro is in public preview effective immediately. On top of that, it is introducing Imagen 2.0 to generate images, and it has unveiled CodeGemma. Beyond that, there’s even more AI news to report.

Google introduced us to BART last year. This model, Google believed at the time, was going to revolutionize AI. Mere months later, the introduction of Gemini has dwarfed BART’s achievements. Today, we’ve already arrived at the (preview) release of Gemini 1.5 Pro. It’s an improved version of Gemini, with the most powerful feature being the huge amount of input that can be put into a prompt to then apply AI to. In fact, Gemini 1.5 Pro has an input context window of 1 million tokens. That’s more than any model at the moment.

Tip: Google introduces its first proprietary ARM CPU, called Axion

Imagen 2.0

With Imagen, it was already possible to generate images. With Imagen 2.0, this can be done a lot better as one would expect, but it’s now also possible to generate ‘live’ images based on text prompts, with a playback time of 4 seconds. It remains to be seen how organizations can use this effectively. What we find much more important about Imagen 2.0 is the ability to edit images. Both within and around the image, all aspects can be adjusted. For example, users can easily remove or add elements. It is also possible to add a digital watermark. Organizations such as Shutterstock and Rakuten are already working with Imagen 2.0.

CodeGemma

Within the Gemma family of LLMs, Google is now introducing two versions of CodeGemma. These include a version with 7 billion parameters and a model with 2 billion parameters. With these, it is possible to improve programming code, as well as to generate complete functions while programming or simply helped by further finishing code. The CodeGemma 7B model runs in the cloud and works through an integration in the IDE, but the 2B model is small enough to run on a laptop.

The CodeGemma models are trained on datasets with more than 500 billion tokens. They are specifically trained to understand code better than the English language. As a result, according to Google, the models are able to generate much more accurate code. Of course, the models can handle a variety of programming languages, including Python, JavaScript, Java and more.

Vertex AI improvements

Google Cloud also has improvements in store within Vertex AI. For example, it will be easier to manage prompts. The biggest issue with Vertex AI right now is the lack of ease around experimenting with, migrating and monitoring prompts. With Vertex AI Prompt Management, that’s a thing of the past; you can now share prompts within a tam, including versioning, so you can always revert to previous versions. In addition, Google offers the ability to use AI to make suggestions to further improve your prompt.

In addition, users can compare two similar prompts side by side and compare the results to see which prompt produces the best results. In this way, organizations can more easily take steps forward in prompt engineering.

Next, Google introduced so-called Evalution Tools, which help users determine which prompt with which model produces the best results. This involves looking at how well the prompt is observed by the model, how much the output still resembles the input, what its quality is, as well as how much time it takes. In this way, it is easier to determine which model is better.

Enterprise Truth, keeping models current with proprietary data

All foundation models are trained on datasets, and after a certain date they stop. That means all models are a little older and less current with every passing day. For some applications, that’s a problem because organizations want the ability to work with current data. For that, Google is introducing what it calls Enterprise Truth, where you can ground an existing model with your own data so that it can still have the data you need.

The data one can attach to the model is not added to the dataset, but as a user runs their prompts, the model does have that data available to generate the output. For example, should you want to generate a daily summary of a stock market day, it can come in handy if you feed the model the stock prices. However, this can also be something as simple as your own organization’s knowledge base.

The amount of updates for the LLMs and Vertex AI are numerous and significant. Google continues to develop new AI solutions and models at a rapid pace. It is doing all this even while many organizations are still searching for exactly how to deploy AI.

Also read: Google Cloud offers sovereign cloud for AI in any data center

Editor picks

HPE’s strategy: AI, smart switches, GreenLake and beyond

New logo, new message?

What is HPE’s Unleash AI program and how does it help companies?

Aible CEO sheds light on how to quickly adopt AI

Memory-safe malware: Rust challenges security researchers

Kernels, drivers, and other critical components of IT systems are bei...

AI only works if the infrastructure is right

AI is in the spotlight, but without a robust infrastructure, it remai...

Insight: Analytics

SAS gives data scientists the steering wheel for the AI (agents) era

Its rich history makes SAS a trusted platform for data scientists. Ho...

SAS launches tailor-made AI models for business processes

SAS has developed a new series of ready-to-use models specifically de...

Snowflake lowers the barrier for building AI apps

Organizations are looking for AI-enabled applications to create smart...

Tableau Pulse uses generative AI to create data analysis on its own

Salesforce just made Tableau Pulse available to the general public. T...

Read more on Applications

Siemens adds AI copilot and VR features to NX software

Siemens has announced major updates to its NX and NX X software. The latest version introduces an AI copilot ...

Berry Zwets 11 hours ago

Microsoft entices Windows 10 users with performance gains

Microsoft has shown new benchmarks that demonstrate that Windows 11 performs significantly faster than Window...

Erik van Klinken June 27, 2025

DeepSeek delayed by GPU export restrictions

DeepSeek has reportedly stalled in the development of its future R2 model because the company does not have a...

Mels Dees June 27, 2025

What is HPE VME and is it a direct competitor to VMware’s hypervisor?

HPE hit the bullseye with VME

Sander Almekinders 12 hours ago

Whitepapers

Experience Synology’s latest enterprise backup solution

How do you ensure your company data is both secure and quickly recove...

How to choose the right Enterprise Linux platform?

"A Buyer's Guide to Enterprise Linux" comprehensively analyzes the mo...

Enhance your data protection strategy for 2025

The Data Protection Guide 2025 explores the essential strategies and...

Strengthen your cybersecurity with DNS best practices

The white paper "DNS Best Practices" by Infoblox presents essential g...

Tech calendar

The AI reality tour

July 2, 2025 Hoeven

GITEX DIGI_HEALTH 5.0 - Thailand

September 10, 2025 BITEC Bangkok, Thailand

IT Arena

September 26, 2025 Lviv, Ukraine

Innovation Week 2025

October 9, 2025 Prague

Luxembourg Venture Days

October 22, 2025 Luxembourg

Appdevcon

March 10, 2026 Amsterdam