#data-scraping
#data-scraping

Major hack of Dutch telco Odido was a classic case of social engineering

Phishing and phone-based social engineering allowed attackers to bypass MFA, access Odido's Salesforce accounts, and scrape up to 6.2 million customer records.

#amazon

fromIntelligencer

E-Commerce

The AI Shopping Wars Are Here

Tech industry

Amazon vs Perplexity: the AI agent war has arrived

fromIntelligencer

E-Commerce

The AI Shopping Wars Are Here

Tech industry

Amazon vs Perplexity: the AI agent war has arrived

more#amazon

fromDataBreaches.Net

Judge orders Anna's Archive to delete scraped data; no one thinks it will comply - DataBreaches.Net

The operator of WorldCat won a default judgment against Anna's Archive, with a federal judge ruling yesterday that the shadow library must delete all copies of its WorldCat data and stop scraping, using, storing, or distributing the data. Anna's Archive is a shadow library and search engine for other shadow libraries that was launched in 2022. It archives books and other written materials and makes them available via torrents,

Law

#ai-training-data

fromFuturism

fromIPWatchdog.com | Patents & Intellectual Property Law

Artificial intelligence

After Being Pillaged By AI Companies, Wikipedia Signs Deal to Get Paid By Them

Artificial intelligence

Reddit Dubs Perplexity AI and Data Scraping Companies 'Would-Be Bank Robbers'

fromThe Mercury News

Artificial intelligence

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

Artificial intelligence

Reddit drags Perplexity in a new lawsuit, accusing it of building up a $20 billion company off stolen data

fromFuturism

fromIPWatchdog.com | Patents & Intellectual Property Law

Artificial intelligence

After Being Pillaged By AI Companies, Wikipedia Signs Deal to Get Paid By Them

Artificial intelligence

Reddit Dubs Perplexity AI and Data Scraping Companies 'Would-Be Bank Robbers'

fromThe Mercury News

Artificial intelligence

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

Artificial intelligence

Reddit drags Perplexity in a new lawsuit, accusing it of building up a $20 billion company off stolen data

more#ai-training-data

#whatsapp

Information security

WhatsApp Tests Usernames as Primary Connection Display

Privacy professionals

Former WhatsApp security boss sues Meta for "systemic cybersecurity failures"

Information security

WhatsApp Tests Usernames as Primary Connection Display

Privacy professionals

Former WhatsApp security boss sues Meta for "systemic cybersecurity failures"

Yes, LinkedIn banned AI agent startup Artisan, but now it's back | TechCrunch

Artisan AI was temporarily banned from LinkedIn over improper use of LinkedIn's name and alleged data scraping, but regained access after addressing the platform's concerns.

Law

Anna's Archive loses .org domain, says suspension likely unrelated to Spotify piracy

Anna's Archive's .org domain appears suspended likely under a court order and faces an OCLC lawsuit alleging it illegally stole 2.2TB of WorldCat data.

#spotify

Music

Pirate library rips 86 million of the most popular songs on Spotify

fromPanamaplaylists

Privacy professionals

Panama Playlists

fromFlowingData

Privacy professionals

Scraping the Spotify playlists of public figures

Music

Pirate library rips 86 million of the most popular songs on Spotify

fromPanamaplaylists

Privacy professionals

Panama Playlists

fromFlowingData

Privacy professionals

Scraping the Spotify playlists of public figures

Google sues data scraping company

Google is suing Serpapi for allegedly using fake queries to bypass protections, scrape copyrighted search-result content, and resell it; Serpapi denies the claims.

fromThe Atlantic

I Am Time Magazine's Person of the Year

For the past two years, my colleague Alex Reisner has investigated precisely how tech companies use massive data sets to train their LLMs. He has repeatedly found that so-called architects of AI have relied heavily on enormous databases of copyrighted work to create chatbots and other programs, and has also found that this work is generally taken without the consent or awareness of its creators: musicians, filmmakers, YouTubers, podcasters, illustrators, writers.

Artificial intelligence

fromTelecompetitor

AT&T sues T-Mobile over "Easy Switch" tool

The price comparison tool within T-Mobile's T-Life app uses AT&T's password-protected software without permission, AT&T told a Texas federal judge on November 30. AT&T is asking for a temporary restraining order. AT&T is accusing T-Mobile of unauthorized scraping of AT&T customer data and says T-Mobile "violates several prohibitions in AT&T's publicly available Terms of Use." It sent a cease-and-desist order to T-Mobile on November 26, but T-Mobile has refused to comply.

US news

#reddit

Intellectual property law

Inside the trap Reddit set for AI startup Perplexity to test whether it was stealing data

Artificial intelligence

Reddit Launches Legal Action to Block AI Companies from Scraping its Data

fromFast Company

Tech industry

Reddit sues Perplexity and others for allegedly scraping millions of user comments

Intellectual property law

Artificial intelligence

Reddit sues Perplexity for allegedly ripping its content to feed AI

Content karma catches up to Reddit in its fight with Anthropic

Reddit is suing Anthropic for allegedly scraping its content to train AI models.

Artificial intelligence

Reddit sues AI company Anthropic for allegedly scraping' user comments to train chatbot

Reddit sues Anthropic for allegedly scraping user comments without consent to train its AI, urging stricter data use limitations.

Intellectual property law

Inside the trap Reddit set for AI startup Perplexity to test whether it was stealing data

Artificial intelligence

Reddit Launches Legal Action to Block AI Companies from Scraping its Data

fromFast Company

Tech industry

Reddit sues Perplexity and others for allegedly scraping millions of user comments

Artificial intelligence

Reddit sues Perplexity for allegedly ripping its content to feed AI

Intellectual property law

Content karma catches up to Reddit in its fight with Anthropic

Artificial intelligence

Reddit sues AI company Anthropic for allegedly scraping' user comments to train chatbot

Reddit sues Anthropic for allegedly scraping user comments without consent to train its AI, urging stricter data use limitations.

Sour Scrapes; (Anti)-trust The Process | AdExchanger

Reddit sues firms for indirectly scraped public data allegedly used to train and sell AI products without licensing.

fromTheregister

Reddit to Perplexity: Get your filthy hands off our forums

Ben Lee, chief legal officer at Reddit, told The Register in an emailed statement that AI companies are desperate for quality content generated by real people and that need is fueling an industrial scale data laundering economy. "Scrapers bypass technological protections to steal data, then sell it to clients hungry for training material," said Lee. "Reddit is a prime target because it's one of the largest and most dynamic collections of human conversation ever created."

Artificial intelligence

Privacy professionals

fromFlowingData

LinkedIn sues company for fake bots

LinkedIn sued ProAPIs and its CEO, alleging they operated millions of fake accounts that scraped and sold member data for up to $15,000 per month.

fromTheregister

Whitebridge AI faces complaint over reputation reports

We fully comply with the GDPR, ensuring your personal data is protected and handled transparently. We only collect publicly available information and you have rights to access, rectify, erase, and restrict processing of your data.

EU data protection

#parking-enforcement

fromsfist.com

Gadgets

Local App-Maker Makes Viral App to Track SF Parking Cops

fromMission Local

Privacy technologies

Want to avoid S.F. parking cops? A 23-year-old's app can help.

fromsfist.com

Gadgets

Local App-Maker Makes Viral App to Track SF Parking Cops

fromMission Local

Privacy technologies

Want to avoid S.F. parking cops? A 23-year-old's app can help.

more#parking-enforcement

fromsfist.com

Asinine Website Ranks SF Restaurant Patrons By Hotness, Using AI

"I scraped millions of Google Maps restaurant reviews, and gave each reviewer's profile picture to an AI model that rates how hot they are out of 10," says San Francisco-based website creater Riley Walz. "This map shows how attractive each restaurant's clientele is. Red means hot, blue means not."

Artificial intelligence

fromThe Atlantic

At Least 15 Million YouTube Videos Have Been Snatched by AI Companies

More than 15.8 million YouTube videos from 2 million channels, including nearly 1 million how-to videos, were downloaded without creators' permission to train AI.

Privacy professionals

fromMashable India

'Meta Is Allowing AI To Read Chats' Paytm Boss's Privacy Warning Gets Flagged: ''Don't Fall For WhatsApp Forwards'

Integration of Meta AI into WhatsApp raises significant privacy concerns regarding user data.

fromHackernoon

Reddit vs. Anthropic: The Lawsuit That Could Put a Price on Your Online Conversations | HackerNoon

Reddit has charged Anthropic with training its AI, Claude, on user posts without consent, highlighting a shift towards monetizing access to digital content.

Privacy professionals

#cloudflare

9 months ago

Privacy professionals

An AI data trap catches Perplexity impersonating Google

fromHackernoon

4 years ago

Artificial intelligence

Cloudflare's AI Labyrinth Bankrupts Data Scrapers | HackerNoon

9 months ago

Privacy professionals

An AI data trap catches Perplexity impersonating Google

fromHackernoon

4 years ago

Artificial intelligence

Cloudflare's AI Labyrinth Bankrupts Data Scrapers | HackerNoon

Perplexity AI crawlers accused of stealth data scraping

Perplexity AI search startup is allegedly disguising its content-scraping bots to ignore website restrictions.

fromDigiday

10 months ago

Media Briefing: AI payouts may be entering a new era

AI compensation models are shifting from flat-fee licensing to varied systems that compensate publishers based on multiple functions of data usage.

Media industry

10 months ago

Pay up or stop scraping: Cloudflare program charges bots for each crawl

Without ongoing contributions from content creators, AI systems risk becoming outdated, biased, or less reliable-ultimately diminishing user trust and the value of AI products.

Tech industry

Marketing tech

10 months ago

Nextdoor CEO on why he won't cut licensing deals with AI companies

Data scraping by AI provides advantages over content owners, compelling Nextdoor to block such practices.

Privacy professionals

fromDatabreaches

Researchers Scrape 2 Billion Discord Messages and Publish Them Online

A massive database of over 2 billion Discord messages has raised privacy concerns among users and moderators.

Artificial intelligence