Amazon Internet Archive

News publishers limit Internet Archive access due to AI scraping concerns

Outlets like The Guardian and The New York Times are scrutinizing digital archives as potential backdoors for AI crawlers.

ZDNet

Reddit blocks the Internet Archive from crawling its data - here's why

The Internet Archive can now only crawl Reddit's homepage. Reddit's goal is to block AI firms from scraping Reddit user data. Publishers (and others) are suing AI companies for copyright infringement.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

News publishers limit Internet Archive access due to AI scraping concerns

Reddit blocks the Internet Archive from crawling its data - here's why

Trending now