AI surpasses human benchmarks in most areas: Stanford report

April 21, 2024

1 min.

DALL·E 2024-04-21 20.57.29 - A visually captivating digital-style illustration depicting the evolution of artificial intelligence over the last two years. The graphic includes sym

Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) has published the seventh annual issue of its AI Index report, finding that artificial intelligence (AI) now outperforms humans in nearly all standard performance tests. This comprehensive report, authored by an interdisciplinary team of experts from both academia and industry, illustrates AI’s rapid advancement and its expanding role across various sectors.

The report analyzes multiple aspects of AI integration, from its adoption across different industries to global attitudes towards its economic impact. However, the highlight is undoubtedly AI’s comparison against human capabilities. Over recent years, AI has managed to surpass human performance in key areas including image classification in 2015, basic reading comprehension by 2017, visual reasoning in 2020, and natural language inference as of 2021.

The pace at which AI is evolving has rendered many traditional benchmarks obsolete, prompting researchers to propose new metrics that not only measure AI’s competence but also explore the nuanced differences between human and machine intelligence. The need for new benchmarks aims to identify remaining human advantages over AI systems.

The AI Index report also delves into AI’s capabilities in performing complex cognitive tasks like advanced mathematics and visual commonsense reasoning. For instance, AI’s proficiency in solving competition-level math problems has seen remarkable improvements; a GPT-4-based model solved 84.3% of such problems in 2023, nearing the human baseline of 90%.

Despite these advancements, the report suggests that AI still faces challenges with tasks requiring deep cognitive abilities, such as visual commonsense reasoning (VCR). VCR tests AI’s ability to use contextual knowledge in visual scenarios to make predictions. Here, AI scored 81.60 in 2023, closely trailing the human baseline of 85.

The implications of these advancements are profound, signaling a shift in the landscape of employment, privacy, security, and ethical considerations. As AI continues to evolve, it raises crucial questions about the future integration of these technologies in daily human activities and the broader societal impact.

The Stanford report not only highlights AI’s capabilities but also its potential to redefine the boundaries between human and machine contributions to society. The ongoing research and the development of new performance benchmarks are crucial to understanding and harnessing AI’s full potential responsibly.

Tags
top story

Jim Love https://www.technewsday.com/

SUBSCRIBE NOW

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways

Subscribe Now

North Korean hacker infiltrates US security vendor, loads malware

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

CrowdStrike CEO summoned by Homeland Security committee over software disaster

Canadian schools sue social media giants over alleged harm to children

ChatGPT mobile mania: Why users are flocking to ChatGPT Plus

iOS update brings back photos users thought were permanently deleted

Microsoft reveals critical security flaw affecting Android apps

CrowdStrike faces backlash over $10 “apology” voucher

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

AI surpasses human benchmarks in most areas: Stanford report

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Homeland Security committee demands appearance by CrowdStrike CEO

SUBSCRIBE NOW

Related articles

CrowdStrike faces backlash over $10 “apology” voucher

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Become a member