- cross-posted to:
- [email protected]
- cross-posted to:
- [email protected]
The New York Times blocks OpenAI’s web crawler::The New York Times has officially blocked GPTBot, OpenAI’s web crawler. The outlet’s robot.txt page specifically disallows GPTBot, preventing OpenAI from scraping content from its website to train AI models.
You must log in or # to comment.
as if a text file is going to stop them
deleted by creator
NYT also uses a third party bot identification and mitigation service.
The question is: Does that crawler adhere to robot.txt policies?
what is the ai being trained for anyways, how to be a NYT journalist?