Everything there is to find on tag: vLLM.
Top story
Red Hat unlocks what’s next with Model-as-a-Service and AgentOps
At Red Hat Summit, the overarching theme is “unlock what’s next.” This naturally encompasses topics lik...
llm-d joins the CNCF
llm-d has been officially accepted as a CNCF Sandbox project. This places the project under the Linux Foundat...
Red Hat launches AI Enterprise for hybrid AI deployments
Red Hat introduces Red Hat AI Enterprise, an integrated platform for deploying and managing models, agents, a...
Red Hat OpenShift 4.20: AI, post-quantum, and broader VM support
Red Hat has released version 4.20 of OpenShift. The solution gets new AI tooling, post-quantum encryption, an...
Top story
Red Hat lays foundation for AI inferencing: Server and llm-d project
This year's annual Red Hat Summit is all about AI inferencing. The open-source company sees a major role for ...
Microsoft expands AKS with RAG functionality and vLLM support
During KubeCon, Microsoft announced that it supports Retrieval Augmented Generation (RAG) in KAITO on Azure K...