Beware of AI 'model collapse': How training on synthetic data pollutes the next generation

from ZDNET 8 months ago

Oxford University scholars warn that using synthetic data to train gen AI can drastically degrade model accuracy, potentially rendering them useless.
ZDNEThttps://www.zdnet.com/article/beware-of-ai-model-collapse-how-training-on-synthetic-data-pollutes-the-next-generation/

Model collapse, a degenerative process described by Ilia Shumailov's team, results in generative models generating data that pollutes subsequent training sets, leading to misperception of reality.
ZDNEThttps://www.zdnet.com/article/beware-of-ai-model-collapse-how-training-on-synthetic-data-pollutes-the-next-generation/

Over generations, models fed synthetic data lose track of less-common facts, becoming generic and producing irrelevant outputs that turn into gibberish.
ZDNEThttps://www.zdnet.com/article/beware-of-ai-model-collapse-how-training-on-synthetic-data-pollutes-the-next-generation/

Read at ZDNET

#synthetic-data #generative-ai #model-collapse #data-training

Collection

[

...

]

Beware of AI 'model collapse': How training on synthetic data pollutes the next generationBeware of AI 'model collapse': How training on synthetic data pollutes the next generation Briefly

Beware of AI 'model collapse': How training on synthetic data pollutes the next generation
Beware of AI 'model collapse': How training on synthetic data pollutes the next generation
Briefly