The American comedian and author Sarah Silverman, along with two other authors Richard Kadrey and Christopher Golden, have filed lawsuits against Meta Platforms’ LLaMa and OpenAI’s ChatGPT over copyright infringement.
Meta and OpenAI are alleged to have used the plaintiffs’ content for training their respective artificial intelligence (AI) systems without obtaining any prior permission.
According to the court documents against Meta, many of the plaintiffs’ books under copyright appear in the dataset that “Meta has admitted to using to train LLaMA.”
Similarly, in the case against OpenAI, the lawsuit alleges that when ChatGPT generates summaries of the plaintiffs’ work it is an indication of the training via copyrighted content.
“The summaries get some details wrong. This is expected since a large language model mixes together expressive material derived from many sources. Still, the rest of the summaries are accurate…”
In order to obtain this data the suits claim that the companies retrieved the copyrighted data from what are known as “shadow libraries,” such as Bibliotik, Library Genesis, Z-Library, and others.
Related: Japanese AI experts raise concern over bots trained on copyrighted material
These shadow libraries are websites that use torrent systems to make books “available in bulk,” says the lawsuit. Such sites are illegal and are unlike open-source data that comes from databases such as Gutenberg, which collects books that have copyrights that have run out.
“These shadow libraries have long been of interest to the AI-training community because of the large quantity of copyrighted material they host.”
Along with complaints about copyright infringement of their own personal work, the authors filed the complaint on behalf of a class of copyright owners across the United States whose works were also allegedly infringed.
Cointelegraph reached out to OpenAI and Meta for comment on the case, though neither responded prior to publication.
In May writers across the U.S. a part of the Writers Guild of America, took to the streets in an authorized strike -the first one in 15 years- which highlighted many issues faced in the industry including the usage of AI.
Magazine: Super Mario: Crypto Thief, Sega blockchain game, AI games rights fight — Web3 Gamer