Automatic literature screening using the PAJO deep-learning model for clinical practice guidelines

Yucong Lin, Jia Li, Huan Xiao, Lujie Zheng, Ying Xiao, Hong Song, Jingfan Fan, Deqiang Xiao, Danni Ai, Tianyu Fu, Feifei Wang*, Han Lv*, Jian Yang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Background: Clinical practice guidelines (CPGs) are designed to assist doctors in clinical decision making. High-quality research articles are important for the development of good CPGs. Commonly used manual screening processes are time-consuming and labor-intensive. Artificial intelligence (AI)-based techniques have been widely used to analyze unstructured data, including texts and images. Currently, there are no effective/efficient AI-based systems for screening literature. Therefore, developing an effective method for automatic literature screening can provide significant advantages. Methods: Using advanced AI techniques, we propose the Paper title, Abstract, and Journal (PAJO) model, which treats article screening as a classification problem. For training, articles appearing in the current CPGs are treated as positive samples. The others are treated as negative samples. Then, the features of the texts (e.g., titles and abstracts) and journal characteristics are fully utilized by the PAJO model using the pretrained bidirectional-encoder-representations-from-transformers (BERT) model. The resulting text and journal encoders, along with the attention mechanism, are integrated in the PAJO model to complete the task. Results: We collected 89,940 articles from PubMed to construct a dataset related to neck pain. Extensive experiments show that the PAJO model surpasses the state-of-the-art baseline by 1.91% (F1 score) and 2.25% (area under the receiver operating characteristic curve). Its prediction performance was also evaluated with respect to subject-matter experts, proving that PAJO can successfully screen high-quality articles. Conclusions: The PAJO model provides an effective solution for automatic literature screening. It can screen high-quality articles on neck pain and significantly improve the efficiency of CPG development. The methodology of PAJO can also be easily extended to other diseases for literature screening.

Original languageEnglish
Article number247
JournalBMC Medical Informatics and Decision Making
Volume23
Issue number1
DOIs
Publication statusPublished - Dec 2023

Keywords

  • Clinical practice guideline
  • Curation
  • Deep learning
  • Natural language processing

Fingerprint

Dive into the research topics of 'Automatic literature screening using the PAJO deep-learning model for clinical practice guidelines'. Together they form a unique fingerprint.

Cite this