Nvidia launches GH200 chip for AI inference market

August 11, 2023

1 min.

Nvidia has announced a new chip designed to run artificial intelligence models, the GH200. The chip has the same GPU as the company’s current highest-end AI chip, the H100, but it pairs that GPU with 141 gigabytes of cutting-edge memory, as well as a 72-core ARM central processor.

The GH200 is designed for inference, the process of using AI models to make predictions or generate content. Inference is computationally expensive, and it requires a lot of processing power every time the software runs. Nvidia says that the GH200 will allow for significantly faster inference speeds, which will make it possible to run larger and more complex AI models.

The GH200 is also designed for scale-out, meaning that it can be used in large data centers to power multiple AI models simultaneously. This makes it well-suited for cloud computing providers and other businesses that need to run a large number of AI models.

Nvidia’s announcement comes as the company faces increasing competition in the AI chip market from AMD and Google. AMD recently announced its own AI-oriented chip, the MI300X, which can support 192GB of memory. Google is also developing its own custom AI chips for inference.

The sources for this piece include an article in CNBC.

Tags
nvidia

TND Newsdesk

SUBSCRIBE NOW

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways

Subscribe Now

North Korean hacker infiltrates US security vendor, loads malware

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

CrowdStrike CEO summoned by Homeland Security committee over software disaster

Canadian schools sue social media giants over alleged harm to children

ChatGPT mobile mania: Why users are flocking to ChatGPT Plus

iOS update brings back photos users thought were permanently deleted

Microsoft reveals critical security flaw affecting Android apps

CrowdStrike faces backlash over $10 “apology” voucher

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Nvidia launches GH200 chip for AI inference market

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Homeland Security committee demands appearance by CrowdStrike CEO

SUBSCRIBE NOW

Related articles

Target’s new AI is aimed at employees

The good and the bad of AI generated code

Microsoft’s AI success may spell defeat for it’s climate goals

OpenAI’s Chief Scientist Ilya Sutskever Departs Company

Become a member