Like a snake eating its own tail: What happens when AI consumes its own data? : Short Wave

May Be Interested In:Downton Abbey star’s new period drama based on acclaimed novel gets first look


In large language model collapse, there are generally three sources of errors: The model itself, the way the model is trained and the data — or lack thereof — that the model is trained on.

Andriy Onufriyenko/Getty Images


hide caption

toggle caption

Andriy Onufriyenko/Getty Images


In large language model collapse, there are generally three sources of errors: The model itself, the way the model is trained and the data — or lack thereof — that the model is trained on.

Andriy Onufriyenko/Getty Images

Asked ChatGPT anything lately? Talked with a customer service chatbot? Read the results of Google’s “AI Overviews” summary feature?

If you’ve used the Internet lately, chances are, you’ve been consuming content created by a large language model.

Large language models, like DeepSeek-R1 or OpenAI’s ChatGPT, are kind of like the predictive text feature in your phone on steroids. In order for them to “learn” how to write, these modesl are trained on millions of examples of human-written text.

In the past, this training usually involved having the models read the whole Internet. But nowadays — thanks in part to these large language models themselves — a lot of content on the Internet is written by generative AI.

That means that AI models trained now may consume their own synthetic content — and suffer the consequences.

View the AI-generated images mentioned in this episode.

Have another topic in artificial intelligence you want us to cover? Let us know my emailing [email protected]!

Listen to Short Wave on Spotify and Apple Podcasts.

Listen to every episode of Short Wave sponsor-free and support our work at NPR by signing up for Short Wave+ at plus.npr.org/shortwave.

This episode was produced by Hannah Chinn. It was edited by our showrunner, Rebecca Ramirez. The audio engineer was Jimmy Keeley.

share Share facebook pinterest whatsapp x print

Similar Content

Should we be moving data centers to space?
Should we be moving data centers to space?
Diamond set to become mainstream coolant for AI GPU servers as world’s best thermal conductor promises 25% better overclocking, and 'double performance per watt'
Diamond set to become mainstream coolant for AI GPU servers as world’s best thermal conductor promises 25% better overclocking, and ‘double performance per watt’
Australians Hit With One Cyber Attack Every Second in 2024
Australians Hit With One Cyber Attack Every Second in 2024
Ipso logo
Man Utd two games away from nightmare scenario Ruben Amorim has faced before
Birmingham's Henry Aslikyan captures his third consecutive City Section title
Birmingham’s Henry Aslikyan captures his third consecutive City Section title
BHMM_jason
» Happy ‘Halloween’: The Best Horror-Movie Monsters
On the Horizon: The Stories That Will Shape the World | © 2025 | Daily News