Nvidia is working on a chip for AI inferencing with Groq technology

In addition to GPUs that handle the lion’s share of AI training, Nvidia wants to introduce a chip for running AI workloads on a daily basis. Thanks to the expertise acquired from startup Groq, the company hopes to provide AI players such as OpenAI with energy-efficient processors to run their AI services.

The chip could be unveiled next month during Nvidia GTC. According to the Wall Street Journal, OpenAI has been given early access to the new processor. On Friday evening, the news outlet reported that Nvidia wants to use this to thwart emerging competition. In the field of AI training, it seems unbeatable, although Google with its Tensor Processing Unit and AMD with its GPUs are real alternatives.

When it comes to inferencing, there are numerous options. AWS and Google Cloud, for example, have chips for this in the public cloud, while many startups are trying to provide a cheaper and more efficient alternative for inferencing. It is therefore no surprise that Nvidia itself is now developing a dedicated inferencing platform. In December, the company signed a $20 billion licensing deal with Groq, also hiring founder Jonathan Ross and president Sunny Madra. Groq’s Language Processing Units (LPUs) are built on a completely new architecture that performs inferencing with significantly less energy consumption.

The company has not yet announced exactly how Nvidia will integrate the technology. The GTC 2026 conference starts on March 16 in San Jose, so we expect more clarity then. It is noteworthy that OpenAI is an early customer, as it had been looking for faster alternatives to Nvidia’s GPUs for some time due to dissatisfaction with the inferencing speed for specific tasks, including software development. Last month, it signed a deal with Cerebras for inferencing chips. Those two deals are separate.

OpenAI as an early adopter

At the same time, OpenAI received $30 billion from Nvidia last week as part of a mega-investment totaling $110 billion. So it seems that peace has been restored between the two parties. OpenAI would like to use the new inferencing chip for Codex, its own programming tool with which it competes with Anthropic’s Claude Code. Coding is one of the most profitable use cases for generative AI, and an area in which OpenAI currently ranks second. Claude Code is, in fact, the standard for programmers working with AI, apart from solutions that companies have developed or purchased internally.

Memory pricing expected to double this quarter beyond record high

TrendForce has dramatically upgraded its memory price outlook for the first quarter of 2026. DRAM contract pr...

Erik van Klinken February 2, 2026

Review

Review ASUS NUC 15 Pro: brings computing power to impossible places

We received an ASUS NUC 15 Pro, a very small mini PC that delivers sufficient computing power for everyday ta...

Coen van Eenbergen February 20, 2026

Vertiv introduces KVM switch for centralized data center management

Vertiv introduces a new generation of KVM switches, designed for secure remote management of IT infrastructur...

Mels Dees February 23, 2026

Top story

ClickShare combines MDEP with ease of use for video conferencing

Organizations are geared up for video conferencing, but their solutions aren't always mature. ClickShare, par...

Erik van Klinken February 17, 2026

Expert Talks

Nvidia is working on a chip for AI inferencing with Groq technology

OpenAI as an early adopter

Stay tuned, subscribe!

Dutch Tax Authority hands US software company control over VAT system

HPE shakes AI network foundation with Juniper PTX12000 series

Dutch telco refuses to pay ransom, hackers to publish customer data

AFX is NetApp's data platform of the future with integrated AI data prep

In-depth conversation about Agentforce IT service and how it wants to change the ITSM market

SAP Business Network: $6.5 trillion B2B collaboration platform

What makes Salesforce agents reliable? Architecture explained

4 steps to create a future-proof data infrastructure

Secure networking: the foundation for the AI era

Why AI adoption requires a dedicated approach to cyber governance

Professional print materials for European tech events, why booth design still makes the difference

Appdevcon

Webdevcon

Dutch PHP Conference

De IT Afdeling van de toekomst

GITEX ASIA 2026

Southeast Asia AI Application Summit 2026

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices