#data-curation

[ follow ]
Information security
fromArs Technica
3 days ago

AI models can acquire backdoors from surprisingly few malicious documents

Small numbers of malicious training samples can install simple backdoors in LLMs, but safety fine-tuning and curated datasets can largely mitigate them.
Digital life
fromHackernoon
2 years ago

Crafting Real-World Queries: MS MARCO Web Search's Authentic Data | HackerNoon

MS MARCO Web Search curates real queries from Bing logs for effective AI training, ensuring authenticity and relevance.
Artificial intelligence
fromFuturism
4 months ago

The Tech Industry Said It Was "Impossible" to Create AI Based Entirely on Ethically-Sourced Data, So These Scientists Proved Them Wrong in Spectacular Fashion

A team of researchers successfully trained a large language model using only public domain or openly licensed data, highlighting an ethical approach.
Marketing tech
fromExchangewire
5 months ago

Data Curation: Examining European Expansion

Data curation is becoming essential for European advertisers, enabling targeted ad inventory management in the evolving ad tech ecosystem.
[ Load more ]