SI
SI
discoversearch

We've detected that you're using an ad content blocking browser plug-in or feature. Ads provide a critical source of revenue to the continued operation of Silicon Investor.  We ask that you disable ad blocking while on Silicon Investor in the best interests of our community.  If you are not using an ad blocker but are still receiving this message, make sure your browser's tracking protection is set to the 'standard' level.
Technology Stocks : Cloud, edge and decentralized computing

 Public ReplyPrvt ReplyMark as Last ReadFilePrevious 10Next 10PreviousNext  
From: Glenn Petersen11/28/2023 2:20:18 PM
   of 1685
 
Amazon announces new AI chip as it deepens Nvidia relationship

PUBLISHED TUE, NOV 28 202311:41 AM EST
UPDATED 2 HOURS AGO
Jordan Novet @JORDANNOVET
CNBC.com

KEY POINTS
  • Amazon Web Services announced Trainium2, a chip for training artificial intelligence models, and it will also offer access to Nvidia’s next-generation H200 Tensor Core graphics processing units.
  • AWS will host a special computing cluster for Nvidia to use.
  • For now, AWS customers can start testing new general-purpose Graviton4 chips.
Amazon’s AWS cloud unit announced its new Trainium2 artificial intelligence chip and the general-purpose Graviton4 processor during its Reinvent conference in Las Vegas on Tuesday. The company also said it will offer access to Nvidia’s latest H200 AI graphics processing units.

Amazon Web Services is trying to stand out as a cloud provider with a variety of cost-effective options. It won’t just sell cheap Amazon-branded products, though. Just as in its online retail marketplace, Amazon’s cloud will feature top-of-the-line products. Specifically, that means highly sought after GPUs from top AI chipmaker Nvidia.

The dual-pronged approach might put AWS in a better position to go up against its top competitor. Earlier this month Microsoft took a similar dual-pronged approach by revealing its inaugural AI chip, the Maia 100, and also saying the Azure cloud will have Nvidia H200 GPUs.

The Graviton4 processors are based on Arm architecture and consume less energy than chips from Intel or AMD. Graviton4 promises 30% better performance than the existing Graviton3 chips, enabling what AWS said is better output for the price. Inflation has been higher than usual, inspiring central bankers to hike interest rates. Organizations that want to keep using AWS but lower their cloud bills to better deal with the economy might wish to consider moving to Graviton.

More than 50,000 AWS customers are already using Graviton chips. Startup Databricks and Amazon-backed Anthropic, an OpenAI competitor, plan to build models with the new Trainium2 chips, which will boast four times better performance than the original model, Amazon said.

AWS said it will operate more than 16,000 Nvidia GH200 Grace Hopper Superchips, which contain H100 GPUs and Nvidia’s Arm-based general-purpose processors, for Nvidia’s research and development group. Other AWS customers won’t be able to use these chips.

Demand for Nvidia GPUs has skyrocketed since startup OpenAI released its ChatGPT chatbot last year, wowing people with its abilities to summarize information and compose human-like text. It led to a shortage of Nvidia’s chips as companies raced to incorporate similar generative AI technologies into their products.

Normally, the introduction of an AI chip from a cloud provider might present a challenge to Nvidia, but in this case, Amazon is simultaneously expanding its collaboration with Nvidia. At the same time, AWS customers will have another option to consider for AI computing if they aren’t able to secure the latest Nvidia GPUs.

Amazon is the leader in cloud computing but has been renting out GPUs in its cloud for over a decade. In 2018 it followed cloud challengers Alibaba and Google in releasing an AI processor that it developed in-house, giving customers powerful computing at an affordable price.

AWS has launched more than 200 cloud products since 2006, when it released its EC2 and S3 services for computing and storing data. Not all of them have been hits. Some go without updates for a long time and a rare few are discontinued, freeing up Amazon to reallocate resources. However, the company continues to invest in the Graviton and Trainium programs, suggesting that Amazon senses demand.

AWS didn’t announce release dates for virtual-machine instances with Nvidia H200 chips, or instances relying on its Trainium2 silicon. Customers can start testing Graviton4 virtual-machine instances now before they become commercially available in the next few months.


Amazon reveals Trainium2 AI chip while deepening Nvidia relationship (cnbc.com)
Report TOU ViolationShare This Post
 Public ReplyPrvt ReplyMark as Last ReadFilePrevious 10Next 10PreviousNext