Reddit to update web standard to block automated website scraping

The Hindu

Wednesday, June 26, 2024 04:40:07 AM UTC

Reddit said it will update a web standard used by the platform to block automated data scraping.

Social media platform Reddit said on Tuesday it will update a web standard used by the platform to block automated data scraping from its website, following reports that AI startups were bypassing the rule to gather content for their systems.

The move comes at a time when artificial intelligence firms have been accused of plagiarising content from publishers to create AI-generated summaries without giving credit or asking for permission.

Reddit said that it would update the Robots Exclusion Protocol, or "robots.txt," a widely accepted standard meant to determine which parts of a site are allowed to be crawled.

The company also said it will maintain rate-limiting, a technique used to control the number of requests from one particular entity, and will block unknown bots and crawlers from data scraping - collecting and saving raw information - on its website.

(For top technology news of the day, subscribe to our tech newsletter Today’s Cache)

More recently, robots.txt has become a key tool that publishers employ to prevent tech companies from using their content free-of-charge to train AI algorithms and create summaries in response to some search queries.

Last week, a letter to publishers by the content licensing startup TollBit said that several AI firms were circumventing the web standard to scrape publisher sites.

Read full story on The Hindu

Share this story on:-

Primary Country (Mandatory)

Other Country (Optional)

Set News Language for United States

Set News Language for World

Set News Source for United States

Set News Source for World

Reddit to update web standard to block automated website scraping

The Hindu

Bombay High Court grants bail to person arrested in the Shivaji statue collapse case

A rinderpest outbreak devastated the gaur population of Mudumalai in 1968

An Adheenam that is unique, but not as influential as other Mutts

Upalokayukta sets 20-day deadline to rejuvenate Kelageri Tank in Dharwad

Centre launches Marine Fisheries census, to be completed in 45 days

Lokayukta police raid unearth assets worth ₹26.6 crore

Litigants affected as advocates stayed away from court proceedings in Vellore, nearby districts

Army picks up locals daily, five beaten up in custody: Kishtwar village sarpanch

Karnataka’s software exports cross ₹4.11 lakh cr. during fiscal 2023-24

Continuous rainfall triggers waterlogging in many areas of Thoothukudi

A ‘bribery scheme’ to bag lucrative solar power contracts

Nomadic settlers vacate plot near Kundannoor flyover in Kochi on their own

Jacto-Geo members stage protest condemning murder of teacher

Peddireddi files nomination for PAC chairman post

AI is augmentative technology rather than a replacement: Palanivel Thiaga Rajan

Only Centre has granted permission for tungsten mining at Arittapatti: Forest Minister

Kanakadasa, Purandaradasa are ‘Ashwini Devatas’ of Haridasa literature, says Vidyabhushana

Space, sea should be subjects of ‘universal cooperation’, not universal conflict: PM Modi in Guyana

Matt Gaetz withdraws as Trump’s pick for Attorney General

Kerala CM Pinarayi Vijayan reaches out MPs to exert pressure on Centre to implement key infrastructure projects

Watch: Kenya cancels airport and energy deals with Adani group after U.S. indicts the tycoon

High-level committee formed to draft SoP for road works

KMF enters Delhi-NCR, as Siddaramaiah launches Nandini brand milk products in national capital

Ensure adequate streetlights are installed along road stretch from Fatima College to Samayanallur: HC tells authorities

Virudhunagar district is saving rainwater in new ponds