Cloudflare reins in AI scraper bots with new Audit panel
Briefly

Rhea says the problem is that the emergence of AI bots has made it more complicated to determine whether programmatic access to a website is beneficial or abusive. While they're not conducting a denial of service attack, bots that capture site data to train AI models or serve AI search results can still present a business threat.
Some customers have already made decisions to negotiate deals directly with AI companies. Many of those contracts include terms about the frequency of scanning and the type of content that can be accessed. We want those publishers to have the tools to measure the implementation of these deals.
AI Data Scraper bots scan the content on your site to train new LLMs. Your material is then put into a kind of blender, mixed up with other content, and used to answer questions from users without attribution or the need for users to visit your site.
As software developer Simon Willison has described it, AI training is akin to 'money laundering for copyrighted data.' Because companies like OpenAI and Anthropic do not disclose the training data used to create their models, AI is essentially content laundering.
Read at Theregister
[
]
[
|
]