AI Watchdog

The Atlantic’s ongoing investigation of the books, videos, and other media used by the world’s most powerful tech companies to train their AI models.

Illustration of burglars behind a castle — Illustration by The Atlantic

The Hypocrisy at the Heart of the AI Industry

Tech companies believe in intellectual property, but not yours.

Alex Reisner

March 20, 2026

Stacked books being crushed into a computer file — Illustration by Matteo Giuseppe Pani / The Atlantic

AI’s Memorization Crisis

Large language models don’t “learn”—they copy. And that could change everything for the tech industry.

Alex Reisner

January 9, 2026

Animated illustration of data sets filled with binders labeled YouTube — Illustration by Matteo Giuseppe Pani / The Atlantic

AI Is Coming for YouTube Creators

At least 15 million videos have been snatched by tech companies.

Alex Reisner

September 10, 2025

Animation of wires going into books — Illustration by Matteo Giuseppe Pani / The Atlantic

The Unbelievable Scale of AI’s Pirated-Books Problem

Meta pirated millions of books to train its AI. Search through them here.

Alex Reisner

March 20, 2025

Animation of movie reels going into folders — Illustration by Matteo Giuseppe Pani / The Atlantic

There’s No Longer Any Doubt That Hollywood Writing Is Powering AI

Dialogue from these movies and TV shows has been used by companies such as Apple and Anthropic to train AI systems.

Alex Reisner

November 18, 2024

Featured Investigations

Illustration of a web browser window forming a hole on the ground — Illustration by Matteo Giuseppe Pani / The Atlantic

The Company Quietly Funneling Paywalled Articles to AI Developers

“You shouldn’t have put your content on the internet if you didn’t want it to be on the internet,” Common Crawl’s executive director says.

Alex Reisner

November 4, 2025

YouTube training data animation — Illustration by Matteo Giuseppe Pani

Search Millions of YouTube Videos Used to Train Generative AI

Inside the data sets training new video-creating tools

Alex Reisner

September 10, 2025

Newsletter

Atlantic Intelligence

Atlantic writers help you wrap your mind around artificial intelligence and a new machine age.

Subject to The Atlantic's Privacy Policy and Terms and Conditions

Latest

Illustrated collage of Hayao Miyazaki and Studio Ghibli art — Illustration by Ben Kothe / The Atlantic. Sources: Frazer Harrison / Getty; Everett Collection.

ChatGPT Turned Into a Studio Ghibli Machine. How Is That Legal?

Three possible arguments against the tech company

Alex Reisner

May 13, 2025

The Unbelievable Scale of AI’s Pirated-Books Problem

Meta pirated millions of books to train its AI. Search through them here.

Alex Reisner

March 20, 2025

Illustration of robotic hands holding Scantron tests — Illustration by The Atlantic. Source: Getty.

Chatbots Are Cheating on Their Benchmark Tests

AI programs train on questions they’re later tested on. So how do we know if they’re getting smarter?

Alex Reisner

March 5, 2025

There’s No Longer Any Doubt That Hollywood Writing Is Powering AI

Dialogue from these movies and TV shows has been used by companies such as Apple and Anthropic to train AI systems.

Alex Reisner

November 18, 2024

Animation of a document being scanned and copied — Illustration by Matteo Giuseppe Pani

Generative AI Is Challenging a 234-Year-Old Law

The technology might finally bend copyright past the breaking point, upending what it means to have a creative society in the process.

Alex Reisner

February 29, 2024

Illustration of a book — Illustration by The Atlantic. Source: Getty.

The Flaw That Could Ruin Generative AI

A technical problem known as “memorization” is at the heart of recent lawsuits that pose a significant threat to generative-AI companies.

Alex Reisner

January 11, 2024

An open book with pages flapping — Video by The Atlantic. Source: Getty.

What I Found in a Database Meta Uses to Train Generative AI

Nobel-winning authors, Dungeons and Dragons, Christian literature, and erotica all serve as datapoints for the machine.

Alex Reisner

September 25, 2023

A mouse cursor clicking on books — Illustration by Joanne Imperio / The Atlantic. Source: Getty.

These 183,000 Books Are Fueling the Biggest Fight in Publishing and Tech

Use our new search tool to see which authors have been used to train the machines.

Alex Reisner

September 25, 2023

Sections

The Print Edition

AI Watchdog

The Hypocrisy at the Heart of the AI Industry

AI’s Memorization Crisis

AI Is Coming for YouTube Creators

The Unbelievable Scale of AI’s Pirated-Books Problem

There’s No Longer Any Doubt That Hollywood Writing Is Powering AI

Featured Investigations

The Company Quietly Funneling Paywalled Articles to AI Developers

Search Millions of YouTube Videos Used to Train Generative AI

Latest

ChatGPT Turned Into a Studio Ghibli Machine. How Is That Legal?

The Unbelievable Scale of AI’s Pirated-Books Problem

Chatbots Are Cheating on Their Benchmark Tests

There’s No Longer Any Doubt That Hollywood Writing Is Powering AI

Generative AI Is Challenging a 234-Year-Old Law

The Flaw That Could Ruin Generative AI

What I Found in a Database Meta Uses to Train Generative AI

These 183,000 Books Are Fueling the Biggest Fight in Publishing and Tech