Tag: AI workloads

Here you will find all the articles with the tag: AI workloads.

Hyperscalers’ AI chip capacity is heavily underutilized

Hyperscalers’ AI chip capacity is heavily underutilized

Most of the AI capacity at major cloud providers remains unused. As a result, AWS, Microsoft and Google are missing out on billions of dollars in revenue, TechInsights analyst Owen Rogers asserts. Full utilization of AI hardware, however, is tough to achieve. TechInsights estimates that with sev... Read more

date2 months ago
Scaleway to offer affordable AI in the cloud with Ampere servers

Scaleway to offer affordable AI in the cloud with Ampere servers

With AI-based services in demand, their cost will have disappointed many companies. That needs to change, is what Ampere Computing and French cloud provider Scaleway believe. With "cost optimized" (COP) Arm instances, organizations could deploy AI workloads more cost-effectively than with the Nvidi... Read more

date4 months ago
GPU shortage drives Fujitsu to make best use of existing hardware

GPU shortage drives Fujitsu to make best use of existing hardware

Fujitsu has announced a new technology that makes optimal use of CPUs and GPUs. Processes that have high execution efficiency are given priority. The Japanese company hopes to bail out organizations plagued by the global GPU shortage caused by the ubiquitous AI hype. Earlier this year, Nvidia st... Read more

date4 months ago
AWS offers GPU power for short AI workloads

AWS offers GPU power for short AI workloads

AWS is introducing Amazon EC2 Capacity Blocks for ML. The new service gives enterprises easy access to cloud-based GPU compute power for short AI workloads. Companies seeking compute power for short AI workloads can now get it from AWS with Amazon EC2 Capacity Blocks for ML. This, according to A... Read more

date5 months ago
Samsung aims to capitalize on AI hype with giant DDR5 modules

Samsung aims to capitalize on AI hype with giant DDR5 modules

Samsung Electronics announced today that it has developed 32-gigabit DDR5 memory on the 12 nanometer process for the first time. This will allow memory modules to house up to 1TB, which should make the chips ideal for AI workloads. Developments are moving quite fast at Samsung Electronics' devel... Read more

date7 months ago
Google launches GKE Enterprise for easier Kubernetes management

Google launches GKE Enterprise for easier Kubernetes management

Google has unveiled GKE Enterprise, an enterprise version of the Google Kubernetes Engine. It builds on existing initiatives to make cloud management easier, such as Anthos in 2019. GKE further integrates with Cloud TPUs v5e, also newly announced, which specialize in running AI workloads. As a m... Read more

date7 months ago
Intel reveals architecture for Xeon chips slated for 2024

Intel reveals architecture for Xeon chips slated for 2024

Intel has unveiled details about the architecture of next year's Xeon processors for servers and workstations. While the fifth generation of these chips is not even on the market yet, we're already getting a decent insight into what its successors will bring to the table. Customers have widely vary... Read more

date7 months ago
Will IBM’s analog chip take the AI world by storm?

Will IBM’s analog chip take the AI world by storm?

A team at IBM Research has developed a mixed-signal analog chip suitable for AI workloads. The project is still in the research phase, but it is looking promising. While generative AI currently eats up huge hardware requirements and power consumption, an alternative seems to be emerging - but nobod... Read more

date8 months ago
1 2