One million public Bluesky posts scraped for AI training
Briefly

“Our dataset is designed for machine learning research and experimentation with social media data. It aggregates public posts, metadata, and relationships from Bluesky Social.”
“Despite Bluesky's commitment not to use user data for AI training, the open nature of the platform raises concerns for users regarding data privacy and usage.”
“Users didn’t opt-in for their data to be used in this way, highlighting potential issues with consent in open social networks like Bluesky.”
“This event serves as a significant cautionary tale for Bluesky's users, particularly those transitioning from other social media platforms with concerning AI policies.”
Read at Mashable
[
|
]