Google open-sources AI model for text generation

Google researchers have developed LaserTagger, an open source model that predicts a sequence of text edits. In this way, a source text can be converted into a target text. Google claims that LaserTagger deals with text generation in a way that is less prone to malfunctions. The model is also easier to train and can be executed faster.

The release of LaserTagger follows a number of other Google contributions in the field of language processing and language comprehension. This week, the tech giant showed Meena, a neural network with 2.6 billion parameters that can handle multi-turn dialog. Earlier this month, Google also published a paper describing Reformer, a model that can process entire novels for translation and text generation.

Overlap between input and output

LaserTagger takes advantage of the fact that for many text generation tasks, there is often an overlap between input and output. For example, when detecting and repairing grammatical errors or when merging multiple sentences, most of the input text can remain unchanged. In other words, only a small part of the words needs to be changed. LaserTagger then performs a series of operations instead of placing actual words, such as the ‘keep’ command (which copies an original word to the output), ‘delete’ (which deletes a word), and ‘keep-addx’ or ‘delete-addx’ (which adds a phrase for a tagged word, and optionally deletes the tagged word).

The added sentences originate from a relatively limited vocabulary, which is optimised. In this way, the size of the vocabulary can be minimised, while the number of training examples is maximised. So, the words needed for the target text only come from that vocabulary, which prevents the model from adding random words. This reduces the problem of ‘hallucination’ (producing output that does not match the input text). In addition, LaserTagger is able to predict operations with high accuracy, allowing an acceleration of the entire process, compared to models that make sequential predictions.

Mistral launches Voxtral: open-source speech recognition for businesses

Mistral is launching its new Voxtral speech models, designed to serve as an alternative to closed APIs offere...

Berry Zwets July 15, 2025

ChatGPT Data Collective gives users control over their data

Critics argue that AI companies exploit user data without permission or compensation. The new ChatGPT Data Co...

Berry Zwets July 2, 2025

Top story

Inside TCS’ digital race behind Formula E

The world of Formula E combines technology and speed with sustainability. It's a blend that Tata Consultancy ...

Erik van Klinken June 27, 2025

Top story

Building on 50 years analytics, SAS charts the future of AI

With close to fifty years of experience, SAS has guided organizations through the major shifts in analytics. ...

Berry Zwets 3 days ago

Expert Talks

Tech calendar

Google open-sources AI model for text generation

Overlap between input and output

Stay tuned, subscribe!

Replatforming virtualized workloads: Do your VMs need a new home?

Broadcom launches Tomahawk Ultra with 250ns network latency

Storyblok Blueprints, speedier setup for web developers

AI requires mature choices from companies

The impact of OpsRamp on HPE and its integration into the stack

What is HPE VM Essentials and is it a direct competitor to VMware?

What is HPE's Unleash AI program and how does it help companies?

Rise with SAP vs Grow with SAP: the different SAP ERP journeys

How AI and automation are redefining ROI in the enterprise

Enhancing video encoding: The AV1 support in the new ARTPEC-9 System-on-Chip

How organisations can remain compliant while building resiliency during the AI era

GITEX DIGI_HEALTH 5.0 - Thailand

IT Arena

Innovation Week 2025

Luxembourg Venture Days

Appdevcon

Webdevcon

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices