Improving Radiology Report Generation with Adaptive Attention

Lin Wang, Jie Chen*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节章节同行评审

1 引用 (Scopus)

摘要

To avoid the tedious and laborious radiology report writing, automatic radiology reports generation has drawn great attention in recent years. As vision to language task, visual features and language features are equally important for radiology report generation. However, previous methods mainly pay attention to generating fluent reports, which neglects the eminent importance of how to better extract and utilize vision information. Keeping this in mind, we propose a novel architecture with a CLIP-based visual extractor and Multi-Head Adaptive Attention (MHAA) module to address the above two issues: through the vision-language pretrained encoders, more sufficient visual information has been explored, then during report generation, MHAA controls the visual information participating in the generation of each word. Experiments conducted on two public datasets demonstrate that our method outperforms state-of-the-art methods on all the metrics.

源语言英语
主期刊名Studies in Computational Intelligence
出版商Springer Science and Business Media Deutschland GmbH
293-305
页数13
DOI
出版状态已出版 - 2023
已对外发布

出版系列

姓名Studies in Computational Intelligence
1060
ISSN(印刷版)1860-949X
ISSN(电子版)1860-9503

指纹

探究 'Improving Radiology Report Generation with Adaptive Attention' 的科研主题。它们共同构成独一无二的指纹。

引用此

Wang, L., & Chen, J. (2023). Improving Radiology Report Generation with Adaptive Attention. 在 Studies in Computational Intelligence (页码 293-305). (Studies in Computational Intelligence; 卷 1060). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-14771-5_21