Prior Guided Transformer for Accurate Radiology Reports Generation

Bin Yan, Mingtao Pei, Meng Zhao*, Caifeng Shan*, Zhaoxing Tian

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

In this paper, we propose a prior guided transformer for accurate radiology reports generation. In the encoder part, a radiograph is firstly represented by a set of patch features, which is obtained through a convolutional neural network and a traditional transformer encoder. Then an Additive Gaussian model is applied to represent the prior knowledge based on unsupervised clustering and sparse attention. In the decoder part, prior embeddings are acquired by probabilistically sampling from the radiograph prior. Then the visual features, language embeddings, and prior embeddings are fused by our proposed Prior Guided Attention to generate accurate radiology reports. Experiment results show that our method achieves better performance than state-of-the-art methods on two public radiology datasets, which proves the effectiveness of our prior guided transformer.

Original languageEnglish
Pages (from-to)5631-5640
Number of pages10
JournalIEEE Journal of Biomedical and Health Informatics
Volume26
Issue number11
DOIs
Publication statusPublished - 1 Nov 2022

Keywords

  • Transformer
  • prior knowledge
  • radiology reports generation
  • sparse attention

Fingerprint

Dive into the research topics of 'Prior Guided Transformer for Accurate Radiology Reports Generation'. Together they form a unique fingerprint.

Cite this