institutional access

You are connecting from
Lake Geneva Public Library,
please login or register to take advantage of your institution's Ground News Plan.

Published loading...Updated

Reddit Blocks Internet Archive to End Sneaky AI Scraping

Reddit blocks Internet Archive from indexing most content to prevent AI data scraping and seeks to monetize data access amid ongoing disputes with AI firms, company statements show.

  • Reddit has announced that it will start blocking bots from The Internet Archive's Wayback Machine due to concerns about AI projects accessing Reddit content from this resource.
  • The Internet Archive, which maintains data on 866 billion web pages, plays a valuable role in preserving digital history, but Reddit's move will significantly limit its capacity on this front.
  • Reddit's decision to restrict access to its data for AI firms seems financially motivated, hoping to spur more lucrative licensing deals like those struck with OpenAI and Google, which are expected to generate over $200 million in revenue over the next three years.
Insights by Ground AI
Does this summary seem wrong?
Podcasts & Opinions

42 Articles

ZDNetZDNet
Reposted by
IT Security News - cybersecurity, infosecurity newsIT Security News - cybersecurity, infosecurity news
Center

Reddit blocks the Internet Archive from crawling its data - here's why

The social media platform is cracking down on backdoor data harvesting.

·United States
Read Full Article
Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/yearSubscribe

Bias Distribution

  • 50% of the sources lean Left
50% Left

Factuality 

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

Social Media Today broke the news in on Monday, August 11, 2025.
Sources are mostly out of (0)