Wednesday, October 29, 2025
Bitcoin In Stock
Shop
  • Home
  • Cryptocurrency
  • Bitcoin
  • Altcoin
  • DeFi
  • Market & Analysis
  • More
    • Blockchain
    • Ethereum
    • Dogecoin
    • XRP
    • NFTs
    • Regulations
  • Shop
    • Bitcoin Book
    • Bitcoin Coin
    • Bitcoin Hat
    • Bitcoin Merch
    • Bitcoin Miner
    • Bitcoin Miner Machine
    • Bitcoin Shirt
    • Bitcoin Standard
    • Bitcoin Wallet
  • Legal Hub
Bitcoin In Stock
No Result
View All Result
Home Blockchain

Reddit blocks the Internet Archive from crawling its data – here’s why

by n70products
August 12, 2025
in Blockchain
0
Reddit blocks the Internet Archive from crawling its data – here’s why
189
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter


gettyimages-2215157577

Andriy Onufriyenko/Getty Photographs

ZDNET’s key takeaways

  • The Web Archive can now solely crawl Reddit’s homepage.
  • Reddit’s purpose is to dam AI companies from scraping Reddit person information.
  • Publishers (and others) are suing AI corporations for copyright infringement.

Reddit is defending its privateness from AI corporations which can be taking roundabout approaches to scraping its content material.

The social media platform, often called a useful resource the place customers can submit anonymously and discover details about nearly any topic, will block the Web Archive’s Wayback Machine from indexing its on-line information, in response to a Monday report from The Verge. The transfer is in response to the invention that AI companies, unable to scrape information from Reddit straight as a result of platform’s prohibitive insurance policies, have as an alternative been retrieving its information from listed content material on the Web Archive and utilizing it to coach fashions.

The Wayback Machine will now solely be capable of scrape information from Reddit’s homepage, in response to The Verge, whereas entry to person profiles, feedback, and submit element pages will probably be blocked.

Launched in 1996, the Web Archive is a non-profit that operates an infinite digital database of net content material. The archive is maintained partly by the Wayback Machine, a chunk of web-crawling software program that gathers net pages and preserves them as they appeared once they have been collected, like digital flies in amber. This serves as a useful resource for researchers learning the evolution of on-line tradition and digital forensic proof for regulation enforcement, amongst different makes use of.

What Reddit’s transfer means

Reddit has beforehand flagged considerations associated to the scraping of its content material with the Web Archive, in response to The Verge. The non-profit was additionally reportedly notified earlier than the web-crawling restrictions began going into impact yesterday.

The Web Archive has but to make an official assertion about the way it plans to reply to Reddit’s new restrictions, and on the time of writing, it has not responded to ZDNET’s request for remark. Wayback Machine director Mark Graham, nevertheless, has advised a number of publications that the Web Archive will “proceed to have ongoing discussions about this matter” with Reddit.

Rising pressure

Reddit’s reported resolution to dam Wayback Machine from scraping nearly all of its content material arrives throughout a second of mounting pressure between AI corporations and digital publishers, although Reddit is the primary tech firm to wade into the controversy. The corporate sued Anthropic in June after discovering that the AI firm was illegally scraping its information, nevertheless it has additionally beforehand signed licensing offers with each Google and OpenAI.

(Disclosure: Ziff Davis, ZDNET’s mother or father firm, filed an April 2025 lawsuit in opposition to OpenAI, alleging it infringed Ziff Davis copyrights in coaching and working its AI programs.) 

AI builders require entry to gargantuan troves of data to coach generative AI fashions, that are designed to establish and replicate delicate mathematical patterns gleaned from these coaching datasets.

Lots of these corporations have scraped coaching information from publicly accessible web sites, together with social media websites and information shops, claiming authorized immunity below an idea recognized in copyright regulation as fair use. (The courts are nonetheless untangling the legitimacy of that argument, and can seemingly be doing so for a while.)

Lots of the organizations whose content material has been copiously scraped — together with a cohort of authors and different artists — have responded with lawsuits. 

Others, in the meantime, have signed content material licensing agreements with the likes of OpenAI, Anthropic, and Google, consenting to the usage of their organizations’ information in trade for elevated visibility within the responses generated by chatbots, or different advantages.





Source link

Tags: ArchiveblockscrawlingdataHeresInternetReddit
  • Trending
  • Comments
  • Latest

Everything announced at Meta Connect 2024: $299 Quest 3S, Orion AR glasses, and more

September 25, 2024

Ethereum turns deflationary: What it means for ETH prices in 2025

October 18, 2024

Ethereum Price Could Still Reclaim $4,000 Based On This Bullish Divergence

February 23, 2025

Uniswap Launches New Bridge Connecting DEX to Base, World Chain, Arbitrum and Others

October 24, 2024

Making the case for Litecoin’s breakout before Bitcoin’s halving

0

Rocket Pool Stands To Reap Big From Ethereum’s Dencun Upgrade, RPL Flying

0

24 Crypto Terms You Should Know

0

Shibarium Breaks The Internet (Again) With Over 400 Million Layer-2 Transactions

0
Bitcoin eyes 6K – Bullish stars align after Fed caution

Bitcoin eyes $116K – Bullish stars align after Fed caution

October 29, 2025
Best early Black Friday Nintendo Switch deals 2025: 20+ sales out early

Best early Black Friday Nintendo Switch deals 2025: 20+ sales out early

October 29, 2025
XRP Price Softens — Momentum Weakness Could Limit Upside In Near Term

XRP Price Softens — Momentum Weakness Could Limit Upside In Near Term

October 29, 2025
Ethereum Whales Double Down On ETH As ,000 Price Target Becomes More Likely

Ethereum Whales Double Down On ETH As $5,000 Price Target Becomes More Likely

October 29, 2025

Recent News

Bitcoin eyes 6K – Bullish stars align after Fed caution

Bitcoin eyes $116K – Bullish stars align after Fed caution

October 29, 2025
Best early Black Friday Nintendo Switch deals 2025: 20+ sales out early

Best early Black Friday Nintendo Switch deals 2025: 20+ sales out early

October 29, 2025

Categories

  • Altcoin
  • Bitcoin
  • Blockchain
  • Blog
  • Cryptocurrency
  • DeFi
  • Dogecoin
  • Ethereum
  • Market & Analysis
  • NFTs
  • Regulations
  • XRP

Recommended

  • Bitcoin eyes $116K – Bullish stars align after Fed caution
  • Best early Black Friday Nintendo Switch deals 2025: 20+ sales out early
  • XRP Price Softens — Momentum Weakness Could Limit Upside In Near Term

© 2024 Bitcoin In Stock | All Rights Reserved

No Result
View All Result
  • Home
  • Cryptocurrency
  • Bitcoin
  • Altcoin
  • DeFi
  • Market & Analysis
  • More
    • Blockchain
    • Ethereum
    • Dogecoin
    • XRP
    • NFTs
    • Regulations
  • Shop
    • Bitcoin Book
    • Bitcoin Coin
    • Bitcoin Hat
    • Bitcoin Merch
    • Bitcoin Miner
    • Bitcoin Miner Machine
    • Bitcoin Shirt
    • Bitcoin Standard
    • Bitcoin Wallet
  • Legal Hub

© 2024 Bitcoin In Stock | All Rights Reserved

Feature

Close the CTA

U.S. Regulated
 

Beginner Friendly
 

Advanced Tools
 

Free Bitcoin Offer
 

Mobile App
 

10$
 

Varies
 

5$
 

Go to mobile version