AI firms must play fair when they use academic data in training

from Nature 7 months ago

Researchers express concerns over how their intellectual property is unrestrainedly utilized in training commercial large language models, emphasizing the urgent need for clear usage boundaries.
Naturehttps://www.nature.com/articles/d41586-024-02757-z?error=cookies_not_supported&code=0f5857f0-f863-4d72-969b-99ea79b61db3

There is an ongoing debate about whether the scraping of academic papers for LLM training constitutes copyright infringement or whether it is permitted under existing law exemptions.
Naturehttps://www.nature.com/articles/d41586-024-02757-z?error=cookies_not_supported&code=0f5857f0-f863-4d72-969b-99ea79b61db3

With large language models relying heavily on data from scientific papers, the necessity for creators to receive credit and the need for detailed disclosure of training datasets have come to the forefront.
Naturehttps://www.nature.com/articles/d41586-024-02757-z?error=cookies_not_supported&code=0f5857f0-f863-4d72-969b-99ea79b61db3

The ambiguity surrounding the legality of using articles and research papers for AI training raises significant questions about intellectual property rights that crucially impact both researchers and tech firms.
Naturehttps://www.nature.com/articles/d41586-024-02757-z?error=cookies_not_supported&code=0f5857f0-f863-4d72-969b-99ea79b61db3

Read at Nature

#intellectual-property #ai-ethics #large-language-models #copyright #research-data

Collection

[

...

]

AI firms must play fair when they use academic data in trainingAI firms must play fair when they use academic data in training Briefly

AI firms must play fair when they use academic data in training
AI firms must play fair when they use academic data in training
Briefly