Digital memory at stake: News outlets block Wayback Machine

"The Internet Archive's Wayback Machine contains over 1 billion archived web pages, serving as a vital resource for journalists, researchers, and historians seeking original online content."

"At least 241 news outlets from nine countries are blocking the archive's web crawlers, including major players like the Guardian and the New York Times."

"Media outlets are concerned that AI firms will use their content from the archive to train language models without permission, violating copyright laws."

"A spokesperson for the New York Times stated that their content on the Internet Archive is being used by AI companies in violation of copyright law to directly compete with them."

The Internet Archive, particularly its Wayback Machine, has been crucial for accessing archived web content. However, it is currently facing significant challenges as over 240 media outlets are blocking its web crawlers. This includes prominent organizations like the New York Times and the Guardian. The primary concern driving this action is the fear that AI companies will exploit archived content for training models without permission. This paradoxically affects the very media outlets that rely on the archive for their own reporting.

#internet-archive #wayback-machine #media-outlets #ai-concerns #copyright-issues

Read at www.dw.com

Unable to calculate read time

Collection

[

...

]

Digital memory at stake: News outlets block Wayback MachineDigital memory at stake: News outlets block Wayback Machine Briefly

Digital memory at stake: News outlets block Wayback Machine
Digital memory at stake: News outlets block Wayback Machine
Briefly