#ai-training-datasets

[ follow ]
VentureBeat
5 months ago
Artificial intelligence

One of the world's largest AI training datasets is about to get bigger and 'substantially better'

The organization EleutherAI, which created the diverse text corpora Pile, became a target of legal and ethical concerns regarding the use of AI training datasets.
Despite facing lawsuits, EleutherAI is collaborating with multiple organizations to build an updated version of the Pile dataset that is expected to be bigger and 'substantially better'. [ more ]
Futurism
2 weeks ago
Artificial intelligence

AI Is Being Trained on Images of Real Kids Without Consent

AI training datasets contain real children's data without consent, raising serious privacy concerns. [ more ]
[ Load more ]