Databricks optimizes LLM deployment on Lakehouse platform

Databricks recently released a preview of GPU and LLM optimization for Model Serving. This should make deploying large AI models on the Lakehouse Platform easier, the company hopes.

The GPU and LLM optimization functionality for Model Serving has been shown in preview form. It provides automatic optimization for so-called LLM Serving and delivers high performance for it without human configuration actions.

Databricks defines the functionality as the first serverless GPU built on a unified data and AI platform. This should enable end users to develop generative AI solutions seamlessly within a single platform, from data ingestion to model deployment and monitoring.

The functionality allows users to deploy a multitude of AI models. Examples include natural language models, computer vision models, audio models or tabular or custom models.

According to Databricks, it does not matter how they are trained and with what type of data.

Reduced latency and costs

LLM models deployed via Model Serving are said to have up to 3.5 times less latency and likewise lower costs. It also achieves up to 2.5 times more throughput.

In the preview, Databricks Model Serving’s GPU and LLM optimization now automatically optimizes MPT and Llama 2 models. Other possible models will be added soon.

Joint AI training without sharing data: FlexOlmo makes it possible

Researchers at the Allen Institute for Artificial Intelligence (AI2) have presented a new framework for train...

Mels Dees July 11, 2025

Citrix returns to the mainstream hypervisor market

Citrix is trying to regain a foothold in the general hypervisor market. The company is seizing the momentum t...

Mels Dees July 10, 2025

Top story

Replatforming virtualized workloads: Do your VMs need a new home?

Finding a balance for VMs and containers

Sander Almekinders 12 hours ago

Tech calendar

Databricks optimizes LLM deployment on Lakehouse platform

Reduced latency and costs

Stay tuned, subscribe!

KnowBe4 evolves from security training to human risk management

Zscaler Cellular brings Zero Trust to IoT and OT devices

Ingram Micro slowly gets back on its feet after ransomware attack

It’s World Backup Day, but backups alone are not enough

How do you build a secure Synology storage system?

Pure’s FlashBlade//EXA should solve storage bottlenecks in AI and HPC

Krijg Volledig Inzicht van Gebruiker tot Cloud met Cisco ThousandEyes

GITEX DIGI_HEALTH 5.0 - Thailand

IT Arena

Innovation Week 2025

Luxembourg Venture Days

Appdevcon

Experience Synology’s latest enterprise backup solution

How to choose the right Enterprise Linux platform?

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices