NYT looks like it’s updated it’s robots.txt file to disallow the Open AI bot from scraping it’s data. Pretty interested to see if they just update their user agent string or if they’ll respect it

  • plz1@lemmy.world
    link
    fedilink
    English
    arrow-up
    17
    ·
    1 year ago

    Updating user agent doesn’t natter unless NYT is actively blocking that, too. Updating robots.txt is purely a “gentleman’s agreement” that OpenAI will respect it. OpenAI would be dumb to ignore it, hat all said, because it’d trigger the lawyer shenanigans to ensue.