Everything there is to find on tag: inference.
Top story
Red Hat progresses AI offerings for accelerated implementation
Red Hat is making progress in expanding its enterprise AI offering. Thanks to validated third-party AI models...
Top story
Red Hat lays foundation for AI inferencing: Server and llm-d project
This year's annual Red Hat Summit is all about AI inferencing. The open-source company sees a major role for ...
Google introduces Ironwood TPU: new powerful AI inference chip
During Google Cloud Next in Las Vegas, Google unveiled its latest Tensor Processing Unit (TPU): Ironwood. Thi...
Rapt AI and AMD want to make AI workloads more efficient on Instinct GPUs
Rapt AI and AMD have announced a strategic partnership to optimize AI workloads on AMD Instinct GPUs. This al...
Nvidia about to acquire cloud startup Lepton AI
Nvidia is busy acquiring the young cloud startup Lepton AI. This would be the second acquisition in a short t...