Prior Guided Transformer for Accurate Radiology Reports Generation

Bin Yan, Mingtao Pei, Meng Zhao*, Caifeng Shan*, Zhaoxing Tian

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

11 引用 (Scopus)

摘要

In this paper, we propose a prior guided transformer for accurate radiology reports generation. In the encoder part, a radiograph is firstly represented by a set of patch features, which is obtained through a convolutional neural network and a traditional transformer encoder. Then an Additive Gaussian model is applied to represent the prior knowledge based on unsupervised clustering and sparse attention. In the decoder part, prior embeddings are acquired by probabilistically sampling from the radiograph prior. Then the visual features, language embeddings, and prior embeddings are fused by our proposed Prior Guided Attention to generate accurate radiology reports. Experiment results show that our method achieves better performance than state-of-the-art methods on two public radiology datasets, which proves the effectiveness of our prior guided transformer.

源语言英语
页(从-至)5631-5640
页数10
期刊IEEE Journal of Biomedical and Health Informatics
26
11
DOI
出版状态已出版 - 1 11月 2022

指纹

探究 'Prior Guided Transformer for Accurate Radiology Reports Generation' 的科研主题。它们共同构成独一无二的指纹。

引用此