A LLM that behaves like a typical Redditor?
What possible use is that?
Air Canada offering a refund of tree fiddy.
You’ll get your refund eventually but first it will try and gaslight you that Air Canada is a woke mind virus before calling you an asshole and then stalking you.
“instead of the $3.50 refund, I’m also authorized to offer you some June 2025 $350 GME calls.”
If it’s trained on the average Reddit reply: $420.69, nice.
I just want to mark the occasion when my previous comment is on 69 points. Noice.
Marketing to terminally online people maybe?
Reddit is a trove of user built content under the guise of community. What Spez did was to say “thanks for all the free work, suckers!”, put a price sticker on it, and laughed all the way to the bank.
And this is why I’m not active on any Internet community anymore.Nevermind, I guess I just can’t help myself…And this is why I’m not active on any Internet community anymore,
you typed.
Active as in “creating meaningful contributions and contributing to the overall knowledge base”. I still shit post from time to time.
This is going to be a really weird thing to argue, but I just casually read through a bunch of your comments and they seem like meaningful contributions.
Well, I guess I can’t help myself… I’ll shitpost more from now on 😅
Somebody asked chat GPT to appear to be a normal internet user to populate the comments section to manufacture content for normal Internet users to respond to so that they can continue building up their training models.
You couldn’t see the sarcasm because it was set to “hidden”.
And that is another unintended example of why all of my post history was purged before migration.
Welcome to the club.
deleted by creator
This is what the 3rd party access to API was really all about.
When API access was allowed , all reddit content was effectively free: They needed to ban 3rd party apps so they could sell the accumulated content. I expect using content to train AI also factors into it.
Considering some of the very wrong and upvoted domain specific knowledge I’ve seen on Reddit over the years I’m not sure the training data is going to be useful for much beyond what every other model can do.
The legal advice in /r/legaladvice was some of the worst garbage I’ve ever seen. I have zero doubt numerous had bad outcomes, at best wasting money and time, at worst spending years in jail because of things that sub told them to say and do. Zero doubt.
I can only assume they are training some specific model for something appearing more human like.
As useless as that will be considering how fucking wildly different we type
Their content?
I wouldn’t be surprised if comments become their intellectual property through some terms of services bullcrap
This is why I don’t blame anyone for editing/deleting their post history on reddit.
The AI:
"IANAL so could you ELI5, so AITA?
THIS."
Ann frankly, I did Nazi that coming.
It’s gonna be trained on everything, even the stuff from 2009, so I’m expecting less of that and more random ‘my fedora chortles intensify’ word salad
Reddit is all bots, porn, ads and political shit posts. Good luck getting any useful training content out of that.
Maybe that’s the point? Training the AI to produce the blabbering bullshit that’s preferred in social media?
They don’t care if the AI produced is useful, they just want to milk as much money from their content as they can.
The API changes were almost certainly just the groundwork for this and I called it at the time. The ridiculous pricing model for API access is because it’s aimed at the hottest tech companies, not third party app developers.
The enshittification continues because it’s what neoliberalism demands. They’ll sell your content and the data they have about you and still show you ads, because that’s the most profitable. Ethics and product quality don’t even enter into it.
“Reddit has given access to YOUR conversations and posts to AI companies.”. FTFY
These were created by people, for peoole, and I will ALWAYS disagree that this data is Reddit’s or any other platforms.
Don’t forget your direct messages aren’t end to end encrypted on Reddit, so now AI will be trained on your craziest “private” conversations
Oh no, all the times I sent or received dodo codes from randos so we could trade animal crossing items. Whatever shall I do?
Edit: I’m gonna leave this here for people to use as a resource against Reddit because it may be worth it to do something actionable.
https://thomashunter.name/posts/2023-06-19-how-to-delete-reddit-account-gdpr-ccpa
now AI will be trained on your craziest “private” conversations
I have no idea what horrible thing this will do to an LLM but I’m kind of curious.
There’s one good news. Reddit didn’t want to pay to move all the old DMs to the new chat infrastructure. So they deleted them.
Pretty sure they just didn’t migrate to the new data structure and didn’t actually delete the raw data. They’re effectively deleted for users but not for Reddit.
With reddits severe bot problem, it’ll be like training on unfiltered sewage. Garbage in, garbage out.
Machines training machines? How perverse!
Damn it. I haven’t deleted my account due to how many people I’ve supported and helped, I stopped using it while ago. It seems I’ll have to.
I wouldn’t bother. They’ll just mark all your stuff DELETED=1 and feed it to their AI anyway.
That’s not a bad idea.
Good thing I scrubbed all of my posts and comments that I could. Fuck that site, straight up and down.
Instead of scrubbing, wordbomb them to screw up any AI training
Oh my sweet summer child
deleted by creator
In before poisoning your comments on Reddit turns into the new protest.
Good. Maybe when it cogitates the things I’ve written it might start offering up some better ideas.
*laughs villainously* This is all going to plan, now there will be some chatbot spewing my insane beliefs