Reddit will update its web standard to block automated website scraping by Reuters

(Reuters) – Social media platform Reddit said on Tuesday it will update a web standard used by the platform to block the automatic collection of data from its website, after reports that AI startups were bypassing the rule to collect content for their systems .

The move comes at a time when artificial intelligence companies are accused of plagiarizing content from publishers to create AI-generated summaries without giving credit or asking permission.

Reddit said it would update the Robots Exclusion Protocol, or “robots.txt,” a widely accepted standard intended to control which parts of a site can be crawled.

The company also said it will enforce rate limiting, a technique used to control the number of requests from a given entity, and block unknown bots and crawlers from collecting and storing data on its website.

More recently, robots.txt has become a key tool that publishers use to prevent tech companies from using their content for free to train AI algorithms and create summaries in response to certain searches.

Last week, a letter to publishers from content licensing startup TollBit said several AI companies were bypassing the web standard to delist publisher sites.

This follows a Wired investigation that found that AI search startup Perplexity likely evaded efforts to block its web crawler via robots.txt.

©Reuters. FILE PHOTO: Reddit's logo is displayed at the New York Stock Exchange (NYSE) in New York City, US, March 21, 2024. REUTERS/Brendan McDermid/File Photo

Earlier in June, business media publisher Forbes accused Perplexity of plagiarizing its research stories for use in generative AI systems, without giving credit for it.

Reddit said Tuesday that researchers and organizations such as the Internet Archive will continue to have access to its content for non-commercial use.

Source link

What's Hot

BAY Miner launches cloud mining mobile app to help users easily mine BTC, SOL, DOGE

Why loyalty is becoming web3 gaming’s next essential layer

Can Automation Unlock the Full Potential of Web3 Gaming?

SEC reportedly considering standard to fast-track crypto ETFs

‘Mastermind of a Complex Web of Deception’ – Ringleader Ran Nationwide Bank Fraud and Money Laundering Operation That Stole $2,000,000: DOJ

US adopts crypto in mortgage risks as Fannie Mae and Freddie Mac update asset models

XRP Ledger unveils update to challenge Ethereum’s dominance

Ethereum developers issue proposal to halve block slot time to boost transaction speed

Website Security Software Market Size & Trends Estimation: Norton, McAfee, Cloudflare, Sucuri, Akamai

BAY Miner launches cloud mining mobile app to help users easily mine BTC, SOL, DOGE

Why loyalty is becoming web3 gaming’s next essential layer

Can Automation Unlock the Full Potential of Web3 Gaming?

Chase’s Points Boost on Flights Is a Massive Points Bust

BAY Miner launches cloud mining mobile app to help users easily mine BTC, SOL, DOGE

Why loyalty is becoming web3 gaming’s next essential layer

Can Automation Unlock the Full Potential of Web3 Gaming?

About us

Popular Categories

Best Categories

What's Hot

Reddit will update its web standard to block automated website scraping by Reuters

Related Posts

About us

Popular Categories

Best Categories

Subscribe to Updates