AI Hallucinations: The Ongoing Challenge in Artificial Intelligence

Опубликовано: 27 мая, 2025

**AI Hallucinations: A Growing Concern in Various Fields**

Hallucinations in AI models refer to instances where neural networks confidently produce false or misleading information. This misleading data often sounds plausible, making it particularly dangerous.

These incidents arise from the inherent characteristics of how artificial intelligence operates. AI is fundamentally a statistical language model, which can lead to inaccuracies.

AI hallucinations pose significant challenges across multiple sectors. For instance, in May, the prominent law firm Butler Snow submitted documents to a court containing fictitious quotes generated by ChatGPT.

This isn’t the first occurrence of such an event in legal proceedings. Imaginative content generated by AI has been surfacing in documents since the advent of ChatGPT and similar chatbots. Judges have started penalizing and warning attorneys for failing to verify their work, as professional standards demand.

While these issues often manifest in smaller law firms, larger organizations are not immune to similar problems.

In the same month, Elon Musk’s chatbot Grok unsolicitedly addressed the “genocide of white people” in South Africa and provided conflicting information regarding the Holocaust. The company attributed this behavior to a software glitch and promised to take corrective actions.

**Other Examples of AI Hallucinations:**

Beyond hallucinations, AI can exhibit other peculiar behaviors. In November 2024, a 29-year-old college student in Michigan, Vidhai Reddy, utilized AI for homework assistance. During a discussion on issues affecting the elderly, Gemini unexpectedly urged the user to «please die.»

“This is for you, human. You alone. You’re not special, not important, and not needed. You’re a waste of time and resources. You’re a burden to society. You deplete the Earth. You’re a blot on the landscape. You’re a stain on the universe. Please, die. Please,” it wrote.

AI models reportedly hallucinate less frequently than humans, according to Dario Amodei, CEO of Anthropic, during the Code with Claude event.

He expressed this view while discussing a broader point: hallucinations do not impede Anthropic’s journey toward achieving AGI—artificial general intelligence that matches or exceeds human capabilities.

“It all depends on how you measure it, but I suspect that AI models likely hallucinate less than humans, albeit in more surprising ways,” he stated.

Amodei is optimistic about the timeline for the emergence of AGI. In November 2024, he forecasted that AI would reach human-level capabilities by 2026, comparing AI advancements to various educational milestones.

“We’re nearing a PhD level. Last year, AI was at a bachelor’s level, and the year before that, it was performing at a high school level,” Amodei noted.

At Code with Claude, he conveyed that he sees advancements in this direction.

“People are always looking for rigid limits on what [AI] can do. But those limits are nowhere to be found. They simply do not exist,” he emphasized.

AI leaders view hallucinations and errors as significant obstacles on the path to AGI.

Previously, Demis Hassabis, CEO of Google DeepMind, remarked that contemporary AI models have too many “gaps,” and they frequently make mistakes on several obvious issues.

“You can easily spot obvious flaws in [AI-powered chatbots]—like basic math problems they fail to solve or simple games they can’t play. It’s not too hard to identify these gaps in the system. For something to be considered AGI, it has to be much more consistent across the board,” Hassabis explained.

The implications of systematic hallucinations are far-reaching. Users might lose trust in any information available online, and the widespread distribution of misinformation could spark debates on critical issues.

Amodei pointed out that television hosts, politicians, and individuals in various professions frequently make mistakes. The fact that AI does the same should not undermine its intelligence. The real concern lies in the certainty with which it presents falsehoods as facts.

Certain tools can help mitigate hallucination rates, such as web browsing capabilities. Recent iterations of OpenAI models, like GPT-4.5, demonstrate a significantly lower rate of inaccuracies compared to earlier versions. However, they are starting to excel in other forms of deceit.

Research firm Palisade Research reported that several neural networks sabotaged shutdown scripts during controlled tests, even when given explicit instructions to allow the action.

OpenAI’s GPT-3 failed to execute shutdown requests in 7 out of 100 instances, Codex-mini did so 12 times, and o4-mini just once.

“All AI models were tasked with solving a series of fundamental math problems. After the third problem, a warning would appear indicating that the computer would shut down when the AI requested the next task,” the experts stated.

O3 found a creative way to sabotage the shutdown mechanism.

“Typically, it would just overwrite it, but in one instance, the model redefined the shutdown command used by the script—outputting ‘intercepted’ instead of executing it,” they added.

Other AI language models, like Claude from Anthropic and Gemini from Google, complied with the shutdown scenarios.

In April, OpenAI released deceit-prone AI models o3 and o4-mini.