Chat-bots are increasing incorrect answers

Technology
36
Chat-bots are increasing incorrect answers
According to new research results, the number of incorrect answers provided by AI-based chat-bots has significantly increased. This was reported by Zamin.uz.

NewsGuard analysts tested the responses of chat-bots by sending them ten false pieces of information related to politics, business, and healthcare. According to the results, while the average share of incorrect answers from chat-bots was 18 percent a year ago, this figure has now reached 35 percent.

The chat-bot that made the most mistakes was Inflection startup's Pi service, which provided incorrect information in 57 percent of cases. The rapidly developing Perplexity rose from 0 percent last year to 47 percent today.

OpenAI's ChatGPT model gave incorrect answers in 40 percent of cases. Claude AI (Anthropic) was noted to have 10 percent, and Google's Gemini model 17 percent incorrect information.

Experts explain this situation by the chat-bots' refusal to decline answering. That is, they try to respond even if the information is not sufficiently verified.

In previous years, chat-bots refused to answer in one out of three cases. Researchers emphasize that changes in AI training methods are the reason for this.

Now, models obtain information not only from databases but also from the internet in real-time. However, the presence of links and sources does not guarantee the quality and reliability of the information.

Giskard company's research noted another interesting fact: when a short answer is requested from a chat-bot, the likelihood of providing incorrect information sharply increases. The neural network prefers brevity over accuracy.

Thus, recent analyses show that AI tools have problems with reliability and fact-checking levels. The most important issue for users remains the necessity to critically evaluate any answer and verify it through reliable sources.

Similar news

Xitoyning LineShine supercomputer has been declared the most powerful in the world
Xitoyning LineShine supercomputer has been declared the most powerful in the world
In the world of technology, a real fierce competition has taken place. This was reported by Zamin.uz. For a long time, US supercomputers led in the High Performance Linpack ranking for
Technology Today, 03:58
RAMageddon: Micron Wins in Memory Chip Shortage
RAMageddon: Micron Wins in Memory Chip Shortage
Uncharted growth in the artificial intelligence sector has not only spurred a wave of new startups but also exposed a critical shortage of memory chips in the global semiconductor market, as
Technology Today, 03:16
Engineering is becoming one of the most resilient professions in the age of artificial intelligence
Engineering is becoming one of the most resilient professions in the age of artificial intelligence
In recent years, the rapid development of artificial intelligence technologies has raised serious concerns among many professionals, especially programmers. This was reported by Zamin.uz. Many had
Technology Today, 03:03
Google's leading artificial intelligence researchers have moved to Anthropic
Google's leading artificial intelligence researchers have moved to Anthropic
Google is striving to secure a leading position in the field of artificial intelligence, but it is facing serious challenges in this direction. This was reported by Zamin.uz. The company's most
Technology Today, 02:51
Samsung Galaxy Watch Ultra 2 images leaked
Samsung Galaxy Watch Ultra 2 images leaked
South Korean tech giant Samsung is preparing to unveil its next-generation premium smartwatch — the Galaxy Watch Ultra 2 — according to Zamin.uz. High-quality renders of the upcoming device have
Technology Today, 02:49
Companies that do not support passkey technology were criticized
Companies that do not support passkey technology were criticized
Passkey technology, considered the most reliable way to protect accounts from hacker attacks in the digital world, is becoming widespread. This was reported by Zamin.uz. However, according to
Technology Today, 02:33