On the internet, nobody knows you are Australian.

also https://lemm.ee/u/MargotRobbie

To tell you the truth, I don’t know who I am either. Somebody sincere, perhaps.

But if you ever read this one day, I hope that you are as proud of me, as I am of the person I imagined you to be.

  • 1 Post
  • 43 Comments
Joined 1 year ago
cake
Cake day: June 17th, 2023

help-circle

  • Reddit, and by extension, Lemmy, offers the ideal format for LLM datasets: human generated conversational comments, which, unlike traditional forums, are organized in a branched nested format and scored with votes in the same way that LLM reward models are built.

    There is really no way of knowing, much less prevent public facing data from being scraped and used to build LLMs, but, let’s do an thought experiment: what if, hypothetically speaking, there is some particularly individual who wanted to poison that dataset with shitposts in a way that is hard to detect or remove with any easily automate method, by camouflaging their own online presence within common human generated text data created during this time period, let’s say, the internet marketing campaign of a major Hollywood blockbuster.

    Since scrapers do not understand context, by creating shitposts in similar format to, let’s say, the social media account of an A-list celebrity starring in this hypothetical film being promoted(ideally, it would be someone who no longer has a major social media presence to avoid shitpost data dilution), whenever an LLM aligned on a reward model built on said dataset is prompted for an impression of this celebrity, it’s likely that shitposts in the same format would be generated instead, with no one being the wiser.

    That would be pretty funny.

    Again, this is entirely hypothetical, of course.




  • The precedent in this case already exists in Midler v. Ford Motor Co., in which when Academy Award nominated actress and singer Bette Midler sued Ford after Ford hired musical impersonators to sing famous songs for their commercials.

    The court ultimately ruled in favor of Midler, because it was found that Ford gave clear instructions to the impersonating actress to sound as much like Midler as possible, and the ruling was voices, although not copyrightable, still constitutes their distinct identity and is protected against unauthorized use without permission. (Outside of satire, of course, since I doubt someone like Trump would be above suing people for making fun of him.)

    I think Scarlett Johansson has a case here, but it really hinges on whether or not OpenAI actively gave the instruction specifically to impersonate Scarlett’s voice in “Her”, or if they used her voice inside the training data at all, since there is a difference in the “Sky” voice and the voice of Scarlett Johansson.

    But then again, what do I know, I’m just here to shitpost and promote “Barbie”.













  • I will offer an alternate explanation:

    Restaurants (and by extension, coffee shop) are inherently very risky businesses to start and operation, I think over 90 percent of them fail in the first year. So, to increase the chances of survival, you have to make sure your business is ran as efficiently as possible, things like rent and location is outside of your control, so you always want to maximize the value of every dollar you spend.

    The first people notice when they go into a new restaurant is cleanliness, nobody wants to eat at a new place that looks dirty, you can only get away from that if you are a decades old local hole in the wall. Ease of cleaning, therefore, is the number 1 priority over everything else.

    So, why the “industrial” concrete/tiled floor and metal chair? Because you can just hose them down at the end of the day. Same thing with big open wooden tables and sparce renovations, ease of cleaning.

    The second thing is you have to avoid major renovations, make the with the space you have and maximize the amount of interesting decorations for the minimum money/work.

    Why put subway tiles on white walls for decorations? Because you don’t have to hire painters, and subway tiles are cheap and interesting looking.

    Why use big open windows and only dim Edison bulbs for lighting? Because hiring electricians to rewire the place you rent for lighting is a lot more expensive than using the big windows you already have.

    Why avocado toast? Because coffee is your main focus, the food is important but secondary, and a piece of fruit on a piece of bread pretty much doesn’t require any cooking.

    It’s really the operation efficiency, rather than some trend following “Instagram” asthetics that led to all these coffee shop looking the same, I think this is a better explanation than what this article proposes.



  • It is exactly because Instagram is at the scale that it is that caused moderation to be difficult. Facebook has relied on using bots to moderate for so long due to its scale, and using bots that are specifically designed to detect AI generated contents is really not possible without introducing a ton of false positives, since the Instagram of the 2020s at its core IS celebrity/influencer advertisement, and there is honestly very little that differentiate what constitutes as "content* and “spam” there.

    Since influencers will be the first to be automated by machines, I just don’t really see a point in having an Instagram account any longer, the inevitable conclusion of creating a fake reality of your life on Instagram is being replaced by a machine that can fake it more efficiently.