Low-quality papers are surging by exploiting public data sets and AI

Last year, Matt Spick began to notice oddly similar papers flooding in for peer review at Scientific Reports, where he is an associate editor. He smelled a rat.

The papers all drew on a publicly available U.S. data set: the National Health and Nutrition Examination Survey (NHANES), which through health exams, blood tests, and interviews has collected dietary information and other health-related measurements from more than 130,000 people. “I was getting so many nearly identical papers—one a day, sometimes even two a day,” says Spick, a statistician at the University of Surrey.

What he was seeing at his one journal is part of a larger problem, Spick has discovered. In recent years, there has been a drastic surge in poor-quality papers using NHANES, possibly spearheaded by illicit moneymaking enterprises known as paper mills and facilitated by the use of artificial intelligence (AI)-generated text, he and colleagues reported in PLOS Biology last week. The finding suggests large public health data sets are ripe for exploitation, they say.

Such free data sources allow almost anyone to take a known research method and swap in new variables to create fresh “findings” in a kind of “research Mad Libs,” says Reese Richardson, a metascientist at Northwestern University who was not involved with the work. Other researchers have found similar “explosions” in a range of topics, he says, including various kinds of genetic studies as well as analyses of bibliometrics or gender disparities in different scientific disciplines.

The NHANES papers Spick was receiving all followed the same formula: They chose a health condition, an environmental or physiological factor that could be associated with it, and a population group—perhaps looking at the link between vitamin D levels and depression in men over age 65, or poor dental health and diabetes in women between the ages of 18 and 45. “It felt like every possible combination was being worked through by someone,” Spick says.

To get a better understanding of how prevalent these studies are, he and his team searched two major databases of scientific papers, PubMed and Scopus, for studies using NHANES data that looked at single associations. They found 341 of these papers published in 147 journals, including Scientific ReportsBMC Public Health, and BMJ Open. Between 2014 and 2021, an average of four such papers were published per year—but a rapid increase kicked off in 2022, with 190 papers published in 2024 up to October, when the researchers did their search. The rise far outstripped the growth in health studies using large data sets generally, the authors report, suggesting some additional factor underlying the swell of NHANES studies.

The timing points to the widespread availability of AI chatbots such as ChatGPT that can generate readable text from simple prompts and uploaded information. They may have been used to rephrase the same basic NHANES findings endlessly to avoid plagiarism detection, says Jennifer Byrne, a molecular biologist at the University of Sydney who peer reviewed the PLOS Biology paper. It’s not possible to conclude with certainty that paper mills—commercial entities that sell authorship on fraudulent or low-quality papers—produced the papers, she says, but the “timing and scale of the increase make you think there has to be some kind of coordination behind this.”

Spick and his co-authors also found that the majority of recent NHANES papers were authored by researchers in China: Ninety-two percent of the papers published after 2021 had a first author affiliated with a Chinese institution, compared with just 8% of papers published before 2021. This also suggests paper mill involvement, Spick says, pointing to findings that the pressures and incentives facing researchers in China drive the use of paper mills.

Many of the more recent NHANES studies selectively analyzed portions of its data set without a clear rationale—for example, authors limited their analysis to certain years, or certain ages of people in the survey. That suggests the authors were on the hunt for statistically significant results to generate easy publications, Spick says. But fishing for results in such a huge data set is bound to come up with many false positive findings. When the team took a closer look at the 28 NHANES studies that had explored depression, they found that only 13 of the results survived a statistical adjustment that corrects for the risk of finding false positives.

Tim Kersjes, head of research integrity at Springer Nature—which owns Scientific ReportsBMC Public Health, and many of the other journals that published papers in the analysis—said the publisher had already retracted a number of NHANES papers, with investigations ongoing. “We have also been raising awareness of these concerns within our editorial community,” he said, with editors asked to remain vigilant and assess papers before sending them out for peer review. A spokesperson for the BMJ Group said it shares concerns about paper mills and AI misuse, takes allegations of misconduct seriously, and will investigate.

Spick and his team think their analysis may drastically underestimate the problem. Their search only looked for NHANES studies that fit the formula Spick had been seeing, but a broader search finds that papers using the data set increased from 4926 in 2023 to 7876 in 2024. In just a quick search, Richardson found five NHANES papers that appear to not be included in Spick’s team’s study but are associated with a suspected paper mill. Similar to those identified by Spick and his colleagues, these papers explore simple associations in the NHANES data set—for example, between electronic cigarette use and lung disease. He also noticed similar papers that use a different data set, the National Inpatient Sample, which gathers data on hospital stays from millions of patients.

Other big health data sets—such as the Global Burden of Disease study—may also be vulnerable, Spick says. These data sets make it easy for researchers to interact with their information using coding languages such as Python or R, but this also makes them easy to exploit: His team was easily able to write code that could pull all the data from NHANES and “chug through the combinations” of diseases and health variables. The “industrialization” of low-quality research overwhelms the literature with useless findings, Spick says. “Honestly, I got really hopping mad about it.”

These papers reflect broad problems in both scientific publishing and how research is rewarded, Richardson says. “All of the publishers named in the article accepted fees, likely on the order of $1000 each, to publish this junk,” he notes. (Open-access journals, including PLOS Biology, generally charge author fees to make papers freely available.) And researchers are incentivized to publish more papers, rather than higher quality papers, in order to advance in their careers, Richardson adds. The problem, he warns, “will only get worse unless we radically restructure incentives around scientific publication.”

Update, 20 May, 5:15 p.m.: The story has been updated with responses from Springer Nature and the BMJ Group, as well as further comments from Spick and Richardson.

See more here Science.org

Please Donate Below To Support Our Ongoing Work To Defend The Scientific Method

PRINCIPIA SCIENTIFIC INTERNATIONAL, legally registered in the UK as a company incorporated for charitable purposes. Head Office: 27 Old Gloucester Street, London WC1N 3AX. 

Trackback from your site.

Comments (2)

  • Avatar

    Tom

    |

    For the medical mafia, they are the highest quality since they are 100% lies and poppycock.

    Reply

  • Avatar

    Jerry Krause

    |

    Hi Tom and PSI Readers,

    “says Reese Richardson, a metascientist at Northwestern University who was not involved with the work.”

    What is a metascientis, or metascience? “Metascience, also known as meta-research or the science of science, is the study of science itself, using scientific methods to improve the quality and efficiency of research practices. It essentially applies scientific inquiry to the scientific process, aiming to understand how science is done and how it can be done better.” (AI)

    Just thought a PSI reader should know. Have a good day

    Reply

Leave a comment

Save my name, email, and website in this browser for the next time I comment.
Share via
Share via