#robotstxt

[ follow ]
Intellectual property law
fromTheregister
1 day ago

Really Simple Licensing spec makes AI orgs pay to scrape

Really Simple Licensing (RSL) 1.0 enables machine-readable rules for crawlers, allowing publishers to declare access, processing, and payment terms for web content.
fromThe Verge
1 day ago

A pay-to-scrape AI licensing standard is now official

An open licensing standard that aims to make AI companies pay for the content they vacuum up across the web is now an official specification. Really Simple Licensing 1.0 - or RSL for short - gives publishers the ability to dictate licensing and compensation rules to the web crawlers that visit their sites. The RSL Collective announced the standard in September with backing from Yahoo, Ziff Davis, and O'Reilly Media.
Artificial intelligence
fromSearch Engine Roundtable
2 days ago

OpenAI Updates Its ChatGPT Crawler OAI-SearchBot

It looks like ChatGPT User (the user action bot) will no longer comply to robots.txt rules (Open changed the wording from "the following robots.txt tags", referring to all 3 user agents to "OAI SearchBot and GPTBot robots.txt tags") OAISearchBot is no longer used to feed the navigational links in ChatGPT answers (blocking this bot does not mean your will not appear in the links)
Artificial intelligence
Artificial intelligence
fromTheregister
2 days ago

Publishers say no to AI scrapers, block bots at server level

Millions of websites are blocking AI crawler bots via robots.txt to prevent training-data scraping and reduce non-human server traffic.
#amazon
fromZDNET
1 week ago
E-Commerce

Amazon puts ChatGPT on the naughty list, blocking shopping access - what we know

fromDigiday
2 weeks ago
E-Commerce

Amazon quietly blocks more of OpenAI's ChatGPT web crawlers from accessing its site

fromZDNET
1 week ago
E-Commerce

Amazon puts ChatGPT on the naughty list, blocking shopping access - what we know

fromDigiday
2 weeks ago
E-Commerce

Amazon quietly blocks more of OpenAI's ChatGPT web crawlers from accessing its site

#cloudflare
fromAdExchanger
1 month ago
Artificial intelligence

Scrapers Gonna Scrape; No More Fast-Forwarding The Ads, DVR Friends | AdExchanger

fromDigiday
2 months ago
Artificial intelligence

Cloudflare updates robots.txt for the AI era - but publishers still want more bite against bots

fromAdExchanger
1 month ago
Artificial intelligence

Scrapers Gonna Scrape; No More Fast-Forwarding The Ads, DVR Friends | AdExchanger

fromDigiday
2 months ago
Artificial intelligence

Cloudflare updates robots.txt for the AI era - but publishers still want more bite against bots

Intellectual property law
fromArs Technica
3 months ago

Pay-per-output? AI firms blindsided by beefed up robots.txt instructions.

RSL enables publishers to declare licensing terms and require compensation from AI crawlers and agents via an automated robots.txt-based protocol.
Artificial intelligence
fromThe Verge
3 months ago

The web has a new system for making AI companies pay up

Really Simple Licensing (RSL) lets web publishers specify licensing and royalty terms in robots.txt and other content to require payment for AI training-data scraping.
Tech industry
fromAdExchanger
3 months ago

Amazon Gets Scraped, Too; LinkedIn Loves Video | AdExchanger

AI companies are crawling Amazon for shopping data, LinkedIn is expanding invite-only video revenue-sharing, and platform competition is generating disputes between Google and Fox.
Intellectual property law
fromTheregister
3 months ago

Asahi, Nikkei sue Perplexity AI for copyright infringement

Perplexity faces a copyright lawsuit from Japan's Nikkei and Asahi alleging unlawful scraping, robots.txt violations, and seeking injunctions plus ¥2.2 billion damages per firm.
E-Commerce
fromDigiday
4 months ago

Shopify has quietly set boundaries for 'buy-for-me' AI bots on merchant sites

Shopify is implementing measures to block agentic AI bots from completing transactions without human review.
Apple
fromSearch Engine Roundtable
7 months ago

Apple Updates Applebot Documentation Explaining Applebot-Extended vs Applebot

Applebot-Extended clarification highlights its role in AI training and implications for web publishers.
[ Load more ]