Efficient Vocal Cord Lesion Recognition by Combing Yolov7 and Attention Module

Yanda Wu, Yuqing He, Dongyan Huang, Yang Liu, Jingxuan Zhu*, Hengli Zhang*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Currently, vocal cord lesion diagnosis of laryngoscopic images mainly relies on physicians' expertise and clinical experience. This greatly increases the work pressure of physicians and has limited efficiency. To solve the above problems, this study aims to construct a deep network structure named VCLR-Net based on the improved YOLOv7 to achieve the detection and recognition of vocal cord lesions. First, Convolutional Block Attention Modules (CBAM) are added to the HEAD network to improve the focus of color and spatial features on lesions. Next, the Alpha Intersection over Union loss (AlphaIOU) loss function is used to improve the robustness of the lesion recognition model. In the experimental results, the proposed VCLR-Net network achieves mAP and F1 of 0.762 and 0.748 in the image dataset. The network enables accurate lesion recognition for a large number of laryngoscopic images.

Original languageEnglish
Title of host publicationAOPC 2023
Subtitle of host publicationOptical Sensing, Imaging, and Display Technology and Applications; and Biomedical Optics
EditorsYadong Jiang, Xiaoyong Wang, Dong Liu, Bin Xue, Yongtian Wang, Liangcai Cao, Qiong-Hua Wang, Chao-Yang Lu
PublisherSPIE
ISBN (Electronic)9781510672321
DOIs
Publication statusPublished - 2023
Event2023 Applied Optics and Photonics China: Optical Sensing, Imaging, and Display Technology and Applications; and Biomedical Optics, AOPC 2023 - Beijing, China
Duration: 25 Jul 202327 Jul 2023

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume12963
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference2023 Applied Optics and Photonics China: Optical Sensing, Imaging, and Display Technology and Applications; and Biomedical Optics, AOPC 2023
Country/TerritoryChina
CityBeijing
Period25/07/2327/07/23

Keywords

  • attention module
  • deep learning
  • improved yolov7
  • laryngoscope image
  • lesion detection

Cite this