AWS promises leading AI performance with its own chips, Nvidia partnership
With the new Trainium 2 and Graviton 4, AWS is revamping its chips, while an expanded collaboration with Nvidia will ensure customers have easier access to the most advanced AI hardware. AWS is making the announcements during its own Re:Invent conference in Las Vegas.
The name of the Trainium 2 ... Read more
Scaleway to offer affordable AI in the cloud with Ampere servers
With AI-based services in demand, their cost will have disappointed many companies. That needs to change, is what Ampere Computing and French cloud provider Scaleway believe. With "cost optimized" (COP) Arm instances, organizations could deploy AI workloads more cost-effectively than with the Nvidi... Read more
GPU shortage drives Fujitsu to make best use of existing hardware
Fujitsu has announced a new technology that makes optimal use of CPUs and GPUs. Processes that have high execution efficiency are given priority. The Japanese company hopes to bail out organizations plagued by the global GPU shortage caused by the ubiquitous AI hype.
Earlier this year, Nvidia st... Read more
AWS offers GPU power for short AI workloads
AWS is introducing Amazon EC2 Capacity Blocks for ML. The new service gives enterprises easy access to cloud-based GPU compute power for short AI workloads.
Companies seeking compute power for short AI workloads can now get it from AWS with Amazon EC2 Capacity Blocks for ML. This, according to A... Read more
Samsung aims to capitalize on AI hype with giant DDR5 modules
Samsung Electronics announced today that it has developed 32-gigabit DDR5 memory on the 12 nanometer process for the first time. This will allow memory modules to house up to 1TB, which should make the chips ideal for AI workloads.
Developments are moving quite fast at Samsung Electronics' devel... Read more
Google launches GKE Enterprise for easier Kubernetes management
Google has unveiled GKE Enterprise, an enterprise version of the Google Kubernetes Engine. It builds on existing initiatives to make cloud management easier, such as Anthos in 2019. GKE further integrates with Cloud TPUs v5e, also newly announced, which specialize in running AI workloads.
As a m... Read more
Intel reveals architecture for Xeon chips slated for 2024
Intel has unveiled details about the architecture of next year's Xeon processors for servers and workstations. While the fifth generation of these chips is not even on the market yet, we're already getting a decent insight into what its successors will bring to the table. Customers have widely vary... Read more
Store your AI workloads in Google Cloud’s customized cloud storage services
Google Cloud is releasing three new cloud storage services for AI workloads. Each service is designed for the needs of a specific AI workload.
AI brings new digital capabilities to businesses but also increases existing storage needs. Like other digital components of a business, AI can be store... Read more
Will IBM’s analog chip take the AI world by storm?
A team at IBM Research has developed a mixed-signal analog chip suitable for AI workloads. The project is still in the research phase, but it is looking promising. While generative AI currently eats up huge hardware requirements and power consumption, an alternative seems to be emerging - but nobod... Read more
Nvidia lets customers rent AI supercomputer with DGX Cloud
Nvidia will be well aware of the huge demand that exists for AI-capable hardware. DGX Cloud will now allow customers to run AI workloads for a monthly rent, avoiding the immense cost of a dedicated AI supercomputer.
Nvidia has an issue that it will be happy to face. Demand for its GPUs is huge b... Read more