Low quality papers are flooding the cancer literature - can this AI tool help to catch them?
Briefly

Low quality papers are flooding the cancer literature - can this AI tool help to catch them?
"An artificial intelligence (AI) tool that scans manuscript titles and abstracts has flagged more than 250,000 cancer studies that bear textual similarities to articles that are known to have been produced by paper mills. These businesses produce fake or low-quality research papers and sell authorships. Articles produced by paper mills often include fabricated data, duplicated images and weird phrases, which are strange wording choices used to evade plagiarism detectors."
"But, paper mills probably rely on boilerplate templates to mass produce papers, says Adrian Barnett, a statistician at Queensland University of Technology in Brisbane, Australia, which could be detected by large language models (LLMs) that analyse patterns in texts. Barnett and his colleagues developed a model and posted their analysis on the preprint server bioRxiv last month. It has not yet been peer reviewed. They emphasize that their findings should be checked by human specialists and are not confirmed cases of research fraud."
An AI tool scanning manuscript titles and abstracts flagged more than 250,000 cancer studies with textual similarities to articles known to be produced by paper mills. Paper mills produce fake or low-quality research papers, sell authorships, and often include fabricated data, duplicated images and odd wording to evade plagiarism detectors. Detection by human integrity specialists is possible but time-consuming and often cannot conclusively prove paper-mill involvement. Barnett and colleagues trained a BERT model to distinguish genuine cancer studies from retracted papers linked to suspected paper-mill activity, and made their analysis available on bioRxiv. The model's results require human verification and are not confirmed cases of fraud.
Read at Nature
Unable to calculate read time
[
|
]