OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

L4sBot@lemmy.world · 1 year ago

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

fubo@lemmy.world · edit-2 1 year ago

If I memorize the text of Harry Potter, my brain does not thereby become a copyright infringement.

A copyright infringement only occurs if I then reproduce that text, e.g. by writing it down or reciting it in a public performance.

Training an LLM from a corpus that includes a piece of copyrighted material does not necessarily produce a work that is legally a derivative work of that copyrighted material. The copyright status of that LLM’s “brain” has not yet been adjudicated by any court anywhere.

If the developers have taken steps to ensure that the LLM cannot recite copyrighted material, that should count in their favor, not against them. Calling it “hiding” is backwards.

UnculturedSwine@lemmy.world · 1 year ago

Another sensationalist title. The article makes it clear that the problem is users reconstructing large portions of a copyrighted work word for word. OpenAI is trying to implement a solution that prevents ChatGPT from regurgitating entire copyrighted works using “maliciously designed” prompts. OpenAI doesn’t hide the fact that these tools were trained using copyrighted works and legally it probably isn’t an issue.

StrongFox@lemmy.world · 1 year ago

you bought the book to memorize from, anyway.

Agent641@lemmy.world · 1 year ago

No, I shoplifted it from an Aldi

khalic@lemmy.world · 1 year ago

An LLM is not a brain, stop anthropomorphising a fkn vector solver… it’s math, there’s nothing alive about it

Jilanico@lemmy.world · 1 year ago

What if you are just a vector solver but don’t realize it? We wouldn’t know we have neurons in our heads if scientists didn’t tell us. What even is consciousness?

khalic@lemmy.world · 1 year ago

All excellent questions, we need the answer to that. Until then, we don’t know, and can’t make up stuff just because we don’t.