ad
CURRENT

Microsoft sued by authors over use of books in AI training

26 Jun 2025, 9:55 AM
Microsoft sued by authors over use of books in AI training

NEW YORK, June 26 — Microsoft has been hit with a suit by a group of authors who claim the company used their books without permission to train its Megatron artificial intelligence model.

Kai Bird, Jia Tolentino, Daniel Okrent and several others alleged Microsoft used pirated digital versions of their books to teach its AI to respond to human prompts. Their suit, filed in New York federal court on Tuesday, is one of several high-stakes cases brought by authors, news outlets and other copyright holders against tech companies including Meta Platforms, Anthropic and Microsoft-backed OpenAI over alleged misuse of their material in AI training.

The complaint against Microsoft came a day after a California federal judge ruled that Anthropic made fair use under United States copyright law of authors’ material to train its AI systems but may still be liable for pirating their books. It was the first US decision on the legality of using copyrighted materials without permission for generative AI training.

Microsoft spokespeople did not immediately respond to a request for comment on the suit. An attorney for the authors declined to comment.

The writers alleged in the complaint that Microsoft used a collection of nearly 200,000 pirated books to train Megatron, an algorithm that gives text responses to user prompts. The complaint said Microsoft used the pirated dataset to create a “computer model that is not only built on the work of thousands of creators and authors, but also built to generate a wide range of expression that mimics the syntax, voice, and themes of the copyrighted works on which it was trained”.

Tech companies have argued that they make fair use of copyrighted material to create new, transformative content, and that being forced to pay copyright holders for their work could hamstring the burgeoning AI industry.

The authors requested a court order blocking Microsoft’s infringement and statutory damages of up to US$150,000 (RM634,650) for each work that Microsoft allegedly misused.

— Reuters

Latest
MidRec
About Us

Media Selangor Sdn Bhd, a subsidiary of the Selangor State Government (MBI), is a government media agency. In addition to Selangorkini and SelangorTV, the company also publishes portals and newspapers in Mandarin, Tamil and English.