Generative AI versus human across multiple systematic review tasks: the regenAIrate project
The rapid emergence of generative artificial intelligence (genAI) is reshaping the methodological landscape of systematic reviews and meta-analyses. AI tools are now being applied across nearly all phases of the systematic review process; however, their performance varies substantially. Persistent challenges, including confabulation, non-deterministic outputs, and opaque “black-box” algorithms, raise concerns about transparency, reproducibility, and methodological rigor. There is therefore a clear need for systematic evaluation of genAI tools in comparison with human reviewers across all stages of the systematic review lifecycle.
The regenAIrate project (Systematic review of generative AI in systematic reviews of the biomedical literature) aims to identify, evaluate, and synthesise the current empirical evidence on the use of genAI throughout the systematic review workflow, including formulation of the PICO question, literature searching, screening, data extraction, risk-of-bias assessment, evidence synthesis, and grading. Particular emphasis is placed on task-specific performance, methodological limitations, and implications for evidence synthesis methods.
To prepare this project, we carried out an exploratory literature mapping exercise. The aim was to identify generative AI tools used in different steps of the systematic review process and to get an initial overview of the available evidence. This work was done as a rapid, single-researcher assessment. It provided useful insights into the field and helped us refine the research question, define the inclusion criteria, and develop the search strategy for the planned living systematic review.
The primary output of regenAIrate will be a living guidance document, updated regularly. All data and analyses will be made publicly available. A central component of the project is the development of a network of Swiss-based researchers with an interest in AI for systematic reviews, alongside fostering collaboration with international experts.