Five artificial intelligences are challenged to read like humans: the winner was not ChatGPT

Five artificial intelligences are challenged to read like humans: the winner was not ChatGPT
Which artificial intelligence bot understands texts better?
porEditorial Team
Argentina

The Washington Post tested five AIs with real texts. Only one was consistent and outperformed the others in all areas

Compartir:

How well can a chatbot understand what it reads? To find out, a team from the Washington Post tested five of the leading AI bots on the market.

They analyzed everything from novels and scientific papers to political speeches and legal contracts. The results brought surprises among the world's most widely used virtual assistants.

Las decisiones clave seguirán necesitando la aprobación humana
Las decisiones clave seguirán necesitando la aprobación humana

Can AI really understand what it reads?

AI bots promise to be reading superpowers: they summarize contracts, books, or research just by uploading a file. But do they really understand what they're reading, or are they just imitating comprehension?

To answer that question, the Washington Post organized a test with the five most popular chatbots: ChatGPT, Claude, Copilot, Meta AI, and Gemini.

  • Four types of text were used: literature, medical science, legal contracts, and political speeches.
  • The texts were evaluated by experts in each field.
  • They formulated 115 questions to analyze comprehension, critical analysis, and accuracy.

Los riesgos del desarrollo acelerado de la IA
Los riesgos del desarrollo acelerado de la IA

Literature: many failed when reading a historical novel

In the literary area, the bots performed poorly. Only Claude got all the key facts from the book right, while ChatGPT provided the best overall summary, although it omitted characters and themes such as slavery.

Gemini was the worst. The book's author compared it to the "Seinfeld" character who watched the movie instead of reading the novel.

ChatGPT hizo el mejor resumen general
ChatGPT hizo el mejor resumen general

Legal contracts: Claude stood out again

According to Sterling Miller, a corporate lawyer, Claude was the only one who understood the most important clauses well. It even proposed useful improvements and detected details that other bots ignored.

Meanwhile, ChatGPT and Meta AI summarized key parts in a single line, something Miller described as "useless."

Medical research: high performance

All five bots showed an acceptable level when reading scientific papers, perhaps because studies have predictable structures and human-written summaries.

Anthropic lanzó su nueva familia de modelos de inteligencia artificial
Anthropic lanzó su nueva familia de modelos de inteligencia artificial

Claude received the highest score (10/10) for explaining a paper on long COVID. It was clear, technical, and useful for doctors. In contrast, Gemini left out essential parts of the study on Parkinson's.

Politics: ChatGPT understood Trump better

Donald Trump's speeches were the biggest challenge in terms of critical analysis. ChatGPT achieved the best balance between context and accuracy.

Estados Unidos planea redoblar las exportaciones de armas del primer mandato de Trump
Estados Unidos planea redoblar las exportaciones de armas del primer mandato de Trump

Copilot, although technically correct, didn't capture the tone of the speeches.

Claude was the most consistent and took first place

Overall, Claude achieved the best performance. It was the only one that stood out in both scientific analysis and legal writing, and it maintained consistent responses.

Unlike other bots that summarized poorly or ignored key parts, Claude proved to be more complete and accurate. According to the judges, it came closest to being a good real assistant.

En el balance general, Claude logró el mejor desempeño
En el balance general, Claude logró el mejor desempeño

Can we trust these bots to read for us?

Claude and ChatGPT proved to be the most capable, but no bot exceeded 70% overall accuracy. All of them, to a greater or lesser extent, omitted key data or caused misleading answers.

While they can be useful as reading assistants, they still don't replace human comprehension. Many times, it's clear that "the robot hides behind a human mask."


Noticias relacionadas

What is it about the massive new housing tax that is making people flee New York

What is it about the massive new housing tax that is making people flee New York

Gestión Sánchez: a Muslim party is running for the first time in the Andalusian elections

Gestión Sánchez: a Muslim party is running for the first time in the Andalusian elections

Hotesur and Los Sauces: the prosecutor asked to speed up the trial against Cristina and Máximo

Hotesur and Los Sauces: the prosecutor asked to speed up the trial against Cristina and Máximo

While Kicillof is going on a trip, doctors from La Plata announced a 72-hour strike due to the IOMA crisis

While Kicillof is going on a trip, doctors from La Plata announced a 72-hour strike due to the IOMA crisis

After the removal of the stocks, Argentines bought more than USD 31 billion

After the removal of the stocks, Argentines bought more than USD 31 billion

Gatorade eliminated artificial colors and is betting on natural ingredients in the US

Gatorade eliminated artificial colors and is betting on natural ingredients in the US

La Derecha Diario logo
ESX logoInstagram logoYouTube logoTikTok logo
ARGENTINABOLIVIAECUADORISRAELMEXICOURUGUAYDERECHA DIARIO TV
  • ESXInstagramYouTubeTikTok
  • DERECHA DIARIO TV
  • Secciones
  • ARGENTINA
  • BOLIVIA
  • ECUADOR
  • ISRAEL
  • MEXICO
  • URUGUAY
  • Países
  • La Derecha Diario logoLA DERECHA DIARIO
  • La Derecha Diario México logoLA DERECHA DIARIO MÉXICO
  • La Derecha Diario Uruguay logoLA DERECHA DIARIO URUGUAY
  • La Derecha Diario Ecuador logoLA DERECHA DIARIO ECUADOR
  • La Derecha Diario Bolívia logoLA DERECHA DIARIO BOLÍVIA
  • La Derechadiario República Dominicana logoLA DERECHADIARIO REPÚBLICA DOMINICANA
  • La Derecha Diario Israel logoLA DERECHA DIARIO ISRAEL
  • La Derecha Diario Estados Unidos logoLA DERECHA DIARIO ESTADOS UNIDOS
  • Temas
  • GUERRA EN IRÁN
  • JUICIO POR YPF
  • El Diario
  • QUIENES SOMOS
  • AUTORES
  • PUBLICIDAD
  • DONAR
La Derecha Diario logo
TwitterInstagramYouTubeTikTok
Derecha Diario TV

Nosotros

  • Quienes Somos
  • Autores
  • Donar

Privacidad

  • Protección de datos
  • Canales
  • Sitemap

Contacto

  • info@derechadiario.com.ar
PUBLICIDAD