About

Five Major Publishers Sue Meta for Using Pirated Books to Train Llama AI

Published
Score
13

Why it matters

Five major publishing houses—Elsevier, Cengage, Hachette, Macmillan, and McGraw Hill—filed a class-action lawsuit against Meta Platforms and CEO Mark Zuckerberg on May 5, 2026, in Manhattan federal court. The publishers allege Meta systematically downloaded millions of copyrighted books and journal articles from pirate repositories including LibGen and Anna's Archive to train its Llama generative AI model without authorization or payment. The complaint further charges that Meta stripped copyright-management information from the works to obscure their sources. Author Scott Turow joined as a named plaintiff. The defendants face unspecified monetary damages claims and potential class certification covering broader copyright holders.

The complaint names Zuckerberg personally, alleging he authorized and directed the infringement and abandoned licensing negotiations in favor of using pirated content. The specific terms of those abandoned negotiations and the full scope of Meta's data acquisition remain undisclosed. The case will turn on whether AI training qualifies as fair use under copyright law—a question now pending across multiple lawsuits involving OpenAI, Anthropic, and other AI developers.

Attorneys representing content creators should monitor this case closely. It represents the publishers' most direct challenge yet to AI companies' training practices and signals an aggressive posture on licensing and market protection. Personal liability allegations against Zuckerberg expand the potential exposure beyond corporate entities. The outcome will likely influence settlement negotiations in dozens of pending copyright cases involving authors, news outlets, and visual artists against major AI developers.

mail Subscribe to Intellectual Property email updates

Primary sources. No fluff. Straight to your inbox.

Also on LawSnap