Inside the trap Reddit set for AI startup Perplexity to test whether it was stealing data
Briefly

Inside the trap Reddit set for AI startup Perplexity to test whether it was stealing data
"But, the lawsuit said, Perplexity continued to cite Reddit in its AI-generated answers - more than ever. The CEO of another AI company even speculated that Perplexity and Reddit secretly struck a content licensing deal. "The increase was so dramatic that an outside observer hypothesized that the increase was due to Perplexity entering a licensing deal with Reddit and thereby obtaining full access to Reddit's data," Reddit's lawsuit said."
""In truth, there is no license between Perplexity and Reddit," the lawsuit said, adding that it was the result of "a scheme by Perplexity to obtain Reddit's data through the circumvention of the technological measures protecting Reddit data." So Reddit set a trap. The company created a test post that could only be crawled by Google's search engine, according to the lawsuit. While Google has a content-licensing deal with Reddit, Perplexity does not."
"It was the digital equivalent of a "marked bill," Reddit's lawsuit said. According to the lawsuit, the only way Perplexity would be able to get the data in the test post is if it bypassed Reddit's guardrails using Google's search engine page results, or SERPS. If the content from the post was ingested by Perplexity through Google, Reddit would know, according to the lawsuit."
Reddit sued Perplexity and data scrapers, alleging they illegally acquired Reddit content by circumventing technical protections. Perplexity had agreed to block scraping but continued to cite Reddit heavily, prompting speculation about a licensing deal; Reddit denies any license. Reddit created a test post only crawlable by Google's search engine; Google has a content-licensing pact with Reddit while Perplexity does not. Reddit characterized the test post as a digital 'marked bill' and said the only way Perplexity could access it was by bypassing Reddit's guardrails using Google's search engine result pages. A few hours after setting the trap, Reddit obtained confirming evidence.
Read at Business Insider
Unable to calculate read time
[
|
]