Visions of a larger plunder

aldalire@lemmy.dbzer0.com · 1 year ago

Visions of a larger plunder

db0@lemmy.dbzer0.com · 1 year ago

aldalire@lemmy.dbzer0.com · 1 year ago

Woah, this is awesome work. I’m amazed as usual with the open source community and with people willing to share their computation for this.

Hot Saucerman@lemmy.ml · edit-2 1 year ago

Closed-source AI models.

Books3 corpus would like you to know that all the data in it is from copyrighted books. It has reportedly been widely used in closed-source AI LLMs. “Rules for thee, not for me” shit. They’ll break copyright and then copyright what they made from it.

https://huggingface.co/datasets/the_pile_books3

Books3 is literally everything from the Bibliotik private tracker for books.

So yeah, fuckin roll out the cannons, mateys, let’s sink these hypocritical fuckers.

Even_Adder@lemmy.dbzer0.com · edit-2 1 year ago

You’re allowed to train on copyrighted works, it isn’t illegal for anybody. This article by Kit Walsh does a good job of breaking it down. She’s a senior staff attorney at the EFF.

Hot Saucerman@lemmy.ml · 1 year ago

I didn’t say it was illegal, I said it was hypocritical.

Even_Adder@lemmy.dbzer0.com · 1 year ago

Oh, my bad.

aldalire@lemmy.dbzer0.com · 1 year ago

This has the same vibe as Github (owned by microsoft) training its AI Copilot on repositories under the GPL license, which specifically forbids any work based on it not be made proprietary. Literally a blatant disregard for the license, but it’s ok because it’s a mega-corporation doing it

The£0b°t°m¡§t@lemmy.dbzer0.com · 1 year ago

You are going straight for the One Piece