Efficient Vocal Cord Lesion Recognition by Combing Yolov7 and Attention Module

Yanda Wu, Yuqing He, Dongyan Huang, Yang Liu, Jingxuan Zhu*, Hengli Zhang*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Currently, vocal cord lesion diagnosis of laryngoscopic images mainly relies on physicians' expertise and clinical experience. This greatly increases the work pressure of physicians and has limited efficiency. To solve the above problems, this study aims to construct a deep network structure named VCLR-Net based on the improved YOLOv7 to achieve the detection and recognition of vocal cord lesions. First, Convolutional Block Attention Modules (CBAM) are added to the HEAD network to improve the focus of color and spatial features on lesions. Next, the Alpha Intersection over Union loss (AlphaIOU) loss function is used to improve the robustness of the lesion recognition model. In the experimental results, the proposed VCLR-Net network achieves mAP and F1 of 0.762 and 0.748 in the image dataset. The network enables accurate lesion recognition for a large number of laryngoscopic images.

源语言英语
主期刊名AOPC 2023
主期刊副标题Optical Sensing, Imaging, and Display Technology and Applications; and Biomedical Optics
编辑Yadong Jiang, Xiaoyong Wang, Dong Liu, Bin Xue, Yongtian Wang, Liangcai Cao, Qiong-Hua Wang, Chao-Yang Lu
出版商SPIE
ISBN(电子版)9781510672321
DOI
出版状态已出版 - 2023
活动2023 Applied Optics and Photonics China: Optical Sensing, Imaging, and Display Technology and Applications; and Biomedical Optics, AOPC 2023 - Beijing, 中国
期限: 25 7月 202327 7月 2023

出版系列

姓名Proceedings of SPIE - The International Society for Optical Engineering
12963
ISSN(印刷版)0277-786X
ISSN(电子版)1996-756X

会议

会议2023 Applied Optics and Photonics China: Optical Sensing, Imaging, and Display Technology and Applications; and Biomedical Optics, AOPC 2023
国家/地区中国
Beijing
时期25/07/2327/07/23

引用此