Stephen King: My Books Were Used to Train AI

L4sBot@lemmy.world · 2 years ago

Stephen King: My Books Were Used to Train AI

CitizenKong@lemmy.world · 2 years ago

The AI in black fled into the desert and the wordslinger followed.

afraid_of_zombies@lemmy.world · 2 years ago

Don’t worry, a later AI will republish it and it will suck.

The Gunslinger was one of my favorites, before King decided to George Lucas it.

TheFrogThatFlies@lemmy.world · 2 years ago

We need an AI with all human knowledge, or various with different specializations. But those AIs must not be in the hands of companies.

Treczoks@lemmy.world · 2 years ago

Now that might give an AI scary ideas…

chiliedogg@lemmy.world · 2 years ago

deleted by creator

afraid_of_zombies@lemmy.world · 2 years ago

So if your AI responses are biased towards car crashes you will know why now.

Take a Stephen King book you have never read. Open a random page and point to a random paragraph. Do this 3x. You will find a car crash, a memory of a car crash, someone talking about a car crash, or someone concluding X happened because of a car crash.

5BC2E7@lemmy.world · 2 years ago

Does he also have a problem with people that were changed by his books?

ashok36@lemmy.world · 2 years ago

I mean, yeah, duh. Just ask any of them to write a paragraph “in the style of INSERT AUTHOR”.

If it can, then it was trained on that author. I’m not sure how that’s a problem though.

ForgotAboutDre@lemmy.world · 2 years ago

We don’t have the legal framework for this type of thing. So people are going to disagree with how using training data for a commercial AI product should work.

I imagine Steven King would argue they didn’t have licenses or permission to use his books to train their AI. So he should be compensated or the AI deleted/retrained. He would argue buying a copy of the book only lets it be used for humans to read. Similar to buying a CD doesn’t allow you to put that song in your advert.

Turun@feddit.de · 2 years ago

Yes, and all of modern fantasy is heavily influenced by Tolkien’s writing, who in turn took inspiration from old legends like Beowulf.

As if human artists and writers are blind to anything ever created.

sab@lemmy.world · 2 years ago

Humans with imperfect memories being influenced by a work <> AI language models being trained on a work.

Turun@feddit.de · 2 years ago

Sure, if you want to see it like that. But if you try out StableDiffusion, etc you will notice that “imperfect memory” describes the AI as well. You can ask it for famous paintings and it will get the objects and colors generally correct, but only as well as a human artist would. The details will be severely lacking. And that’s the best case scenario for the AI, because famous paintings will be over represented in the training data.

sab@lemmy.world · 2 years ago

Nah.

By default an AI will draw from its entire memory, and so will have lots of different influences. But by tuning your prompt (or restricting your input dataset) you can make it so specific, it’s basically creating near perfect clones. And contrary to a human, it can then produce similar works hundreds of times per minute.

But even that is beside the point. Those works were sold under the presumption that people will read them. Not to ingest them into a LLM or text-to-image model. And now, companies like openai and others profit from the models they trained without permission from the original author. That’s just wrong.

BetaDoggo_@lemmy.world · 2 years ago

Obviously restricting the input will cause the model to overfit, but that’s not an issue for most models where Billions of samples are used. In the case of stable diffusion this paper had a ~0.03% success rate extracting training data after 500 attempts on each image, ~6.23E-5% per generation. And that was on a targeted set with the highest number of duplicates in the dataset.

The reason they were sold doesn’t matter, as long as the material isn’t being redistributed copyright isn’t being violated.

AEsheron@lemmy.world · 2 years ago

Have you used Stable Diffusion. I defy you to make a perfect clone of any image. Take a whole week to try and refine it if you want. It is basically impossible by definition, unless you only trained it on that one image.