Aug 12, 2025 at 8:35 PM

Reddit to block Wayback Machine from indexing its content over AI data scraping concerns

Reddit will restrict most of the Internet Archive’s Wayback Machine from indexing its content, citing concerns that AI companies are scraping data from archived pages to bypass the platform's controls. Under the new policy, the Wayback Machine loses access to Reddit post detail pages, user profiles, and comments. Only the Reddit.com homepage will remain available for daily archival.

As a result, the Internet Archive can now capture only basic daily snapshots of trending headlines, without preserving full post content or discussion threads. According to Reddit, some AI companies have used archived pages to scrape Reddit data in violation of the company’s policies. These restrictions will remain until the Internet Archive can better prevent scraping, comply with Reddit's privacy rules, and reliably delete removed content.

Reddit informed the Internet Archive in advance and said the limits would begin ramping up immediately. The move aligns with Reddit’s ongoing efforts to curb bulk data extraction, including 2023 API restrictions and paid data deals with AI and search firms. In 2024 and 2025, Reddit signed agreements with Google and OpenAI, blocked major search engines, and sued Anthropic for alleged continued scraping.

Aug 12, 2025 by Mauricio B. Holguin

city_zen found this interesting

MORE ABOUT: #Social Networks #Social News #Wayback Machine #Reddit #Internet Archive

602

Social Network
Freemium
Proprietary

Reddit is a social news platform where user-shared content is voted on and organized into various 'subreddits.' This community-driven network enables discussions across diverse topics, potentially bringing content to the front page. Key features include a robust commenting system and voting mechanism. Reddit is rated 2.8, with top alternatives being other social networks and discussion platforms.

External links

Reddit will block the Internet Archive
The Verge
Reddit blocks non-profit Wayback Machine from archiving the site
9to5Mac
Reddit is blocking Wayback Machine from archiving users' posts
Mashable

Comments

Benjamin Brooks

CommentAug 18, 2025

honestly i think using a frontend to reddit is a workaround, have to check later

RDF0909

CommentAug 14, 2025

-2

Oh no, we won't be preserving all those genuinely retarded opinion posts!

Scheldon Oliveira

CommentAug 14, 2025

-1

Funny how this helps reddit AI scrapping and their compulsion of say bullshit

Sam Lander

CommentAug 13, 2025

Only because they are using their own weak AI and want to kill competition. As everyone knows, the Reddit team is all about showing they are socially progressive while being strongly capitalist.

Navi

CommentAug 12, 2025

All so they can just scrape data themselves. Fake AI is killing the internet.

1 reply

cntchngusrnme

CommentAug 12, 2025

and their bribers

youlk1234

CommentAug 12, 2025

Uuh, well start using your own archiving solution. But that's a bummer...

Reddit to block Wayback Machine from indexing its content over AI data scraping concerns

Related news

External links

Comments