Outlets like The Guardian and The New York Times are scrutinizing digital archives as potential backdoors for AI crawlers.
The Internet Archive can now only crawl Reddit's homepage. Reddit's goal is to block AI firms from scraping Reddit user data. Publishers (and others) are suing AI companies for copyright infringement.