Nvidia launches GH200 chip for AI inference market

Share post:

Nvidia has announced a new chip designed to run artificial intelligence models, the GH200. The chip has the same GPU as the company’s current highest-end AI chip, the H100, but it pairs that GPU with 141 gigabytes of cutting-edge memory, as well as a 72-core ARM central processor.

The GH200 is designed for inference, the process of using AI models to make predictions or generate content. Inference is computationally expensive, and it requires a lot of processing power every time the software runs. Nvidia says that the GH200 will allow for significantly faster inference speeds, which will make it possible to run larger and more complex AI models.

The GH200 is also designed for scale-out, meaning that it can be used in large data centers to power multiple AI models simultaneously. This makes it well-suited for cloud computing providers and other businesses that need to run a large number of AI models.

Nvidia’s announcement comes as the company faces increasing competition in the AI chip market from AMD and Google. AMD recently announced its own AI-oriented chip, the MI300X, which can support 192GB of memory. Google is also developing its own custom AI chips for inference.

The sources for this piece include an article in CNBC.

Featured Tech Jobs

SUBSCRIBE NOW

Related articles

AI surpasses human benchmarks in most areas: Stanford report

Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) has published the seventh annual issue of its AI Index...

Microsoft and OpenAI partner to build a $100 Billion AI supercomputer “Stargate”

In a bold stride towards computational supremacy, Microsoft, in partnership with OpenAI, is reported to be laying the...

US Bill Aims to Unveil AI Training Data Sources Amid Copyright Concerns

In a significant move toward transparency, a bill was introduced in the US Congress on Tuesday by California...

AI presents an “extinction level threat” – US Gov’t Report: Hashtag Trending for Tuesday, March 12, 2024

A new US government report warns that AI presents an “extinction level threat to the human species. Elon Musk is outsourcing his Grok AI code. Hackers have breached the Cybersecurity and Infrastructure Security Agency in the US and a researcher shows how to steal a Tesla by leveraging a feature of the Tesla charging stations.

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways