A new web crawler launched by Meta last month is quietly scraping the web for AI training data

from Fortune 7 months ago

Meta's new web crawler, Meta External Agent, scrapes publicly available data for AI training, similar to OpenAI’s GPTBot, raising concerns about data usage.
Fortunehttps://fortune.com/2024/08/20/meta-external-agent-new-web-crawler-bot-scrape-data-train-ai-models-llama/

The existence of the Meta External Agent was confirmed by web scraping tracking firms, indicating a shift in how Meta collects data for its AI models.
Fortunehttps://fortune.com/2024/08/20/meta-external-agent-new-web-crawler-bot-scrape-data-train-ai-models-llama/

Meta's spokesman confirmed that the company has been operating crawlers for years but updated its guidance for publishers to exclude their domains from scraping.
Fortunehttps://fortune.com/2024/08/20/meta-external-agent-new-web-crawler-bot-scrape-data-train-ai-models-llama/

The practice of scraping data for AI training has faced backlash from content creators, leading to lawsuits over the unauthorized use of intellectual property.
Fortunehttps://fortune.com/2024/08/20/meta-external-agent-new-web-crawler-bot-scrape-data-train-ai-models-llama/

Read at Fortune

#meta #web-crawler #ai-training #data-scraping #intellectual-property

Collection

[

...

]

A new web crawler launched by Meta last month is quietly scraping the web for AI training dataA new web crawler launched by Meta last month is quietly scraping the web for AI training data Briefly

A new web crawler launched by Meta last month is quietly scraping the web for AI training data
A new web crawler launched by Meta last month is quietly scraping the web for AI training data
Briefly