Scientists Train AI to Be Evil, Find They Can’t Reverse It::How hard would it be to train an AI model to be secretly evil? As it turns out, according to Anthropic researchers, not very.

    • TropicalDingdong@lemmy.world
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      2
      ·
      10 months ago

      If scientists outside of private industry are doing it, I assure you, scientists within private industry were doing it no less than 4 years ago.

      Shits sailed bro. Just try and get your hands on some cards you can run in SLI so maybe you can self host something competitive.

      • BluesF@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        10 months ago

        Shits sailed

        Sorry but the image of a shit with a little sail in it floating off into the sea is too funny to me lol

  • AbouBenAdhem@lemmy.world
    link
    fedilink
    English
    arrow-up
    18
    arrow-down
    1
    ·
    edit-2
    10 months ago

    Seems like a weird definition of “evil”. “Selectively inconsistent” might be more accurate.

  • the_q@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    1
    ·
    10 months ago

    Is this really that surprising? Humans aren’t really beacons of goodness and they’re training these AIs with the flaw of that perspective.

    • Obinice@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      10 months ago

      What do you mean I’m not a beacon of goodness?! Say that again and I’ll get stabby!!