Study Finds Generative AI Lacks Coherent World Understanding, Despite Impressive Capabilities

Share post:

A recent study from researchers at MIT and Harvard suggests that while generative AI models can perform impressive tasks like generating text or navigating cities, they do not necessarily form a coherent understanding of the world. This was demonstrated through experiments showing that AI models, despite accurately providing driving directions in New York City or making valid moves in games like Othello, struggle when conditions change. For example, when streets were closed in New York City, the AI’s accuracy dropped significantly.

The research focused on transformers, the backbone of large language models (LLMs) like GPT-4, which predict the next word or token in a sequence. Despite their predictive power, these models can perform well without truly understanding the underlying rules of tasks, like how streets connect or how game strategies work. The researchers created two new metrics—sequence distinction and sequence compression—to test whether the models could form an accurate world model for tasks like navigating and playing games. Surprisingly, models trained on randomly generated data formed more accurate world models than those trained on data from strategic choices.

The study found that while these AI models can appear to understand tasks, they often rely on incomplete or incorrect internal models, as seen in the case of AI-generated city maps filled with nonexistent streets. The researchers argue that for AI to be useful in more complex, real-world scenarios, scientists need to develop better approaches to ensure that models not only make accurate predictions but also form coherent and reliable world models. 

The research raises important concerns about deploying AI in real-world settings where unexpected changes, like detours or novel challenges, could lead to failures.

Here is a link to the study to find out more.

SUBSCRIBE NOW

Related articles

AWS re:Invent 2024 AI Announcements – Reduced Cost, Increase Accuracy And More

At its re:Invent 2024 conference, Amazon Web Services (AWS) announced two significant advancements aimed at driving down AI...

AI vs Ghost Engineers: Hashtag Trending for Monday, Dec. 2, 2024

Hashtag Trending is brought to you this week by Elisa: A Tale of Quantum Kisses, a science fiction...

AI: What’s Holding You Back? Project Synapse: AI in Action on Hashtag Trending Weekend Edition for November 30, 2024

Exploring AI Security and Strategy | Hashtag Trending Weekend Edition #3 In Episode 4 of our Project Synapse series,...

Cisco AI Readiness Index Highlights Canada’s Growing AI Urgency but Declining Preparedness

Canada’s AI readiness has dipped in 2024, with only 7% of organizations fully prepared to deploy AI, down...

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways