
The Hypocrisy at the Heart of the AI Industry
Tech companies believe in intellectual property, but not yours.
The Atlantic’s ongoing investigation of the books, videos, and other media used by the world’s most powerful tech companies to train their AI models.

Tech companies believe in intellectual property, but not yours.

Large language models don’t “learn”—they copy. And that could change everything for the tech industry.

At least 15 million videos have been snatched by tech companies.

Meta pirated millions of books to train its AI. Search through them here.

Dialogue from these movies and TV shows has been used by companies such as Apple and Anthropic to train AI systems.

“You shouldn’t have put your content on the internet if you didn’t want it to be on the internet,” Common Crawl’s executive director says.

Inside the data sets training new video-creating tools
Atlantic writers help you wrap your mind around artificial intelligence and a new machine age.

Three possible arguments against the tech company

Meta pirated millions of books to train its AI. Search through them here.

AI programs train on questions they’re later tested on. So how do we know if they’re getting smarter?

Dialogue from these movies and TV shows has been used by companies such as Apple and Anthropic to train AI systems.

The technology might finally bend copyright past the breaking point, upending what it means to have a creative society in the process.

A technical problem known as “memorization” is at the heart of recent lawsuits that pose a significant threat to generative-AI companies.

Nobel-winning authors, Dungeons and Dragons, Christian literature, and erotica all serve as datapoints for the machine.

Use our new search tool to see which authors have been used to train the machines.