TY - JOUR
T1 - Laryngoscope8
T2 - Laryngeal image dataset and classification of laryngeal disease based on attention mechanism
AU - Yin, Li
AU - Liu, Yang
AU - Pei, Mingtao
AU - Li, Jinrang
AU - Wu, Mukun
AU - Jia, Yuanyuan
N1 - Publisher Copyright:
© 2021
PY - 2021/10
Y1 - 2021/10
N2 - Laryngeal disease is a common disease worldwide. However, currently there are no public laryngeal image datasets, which hinders the development of automatic classification of laryngeal disease. In this work, we build a new laryngeal image dataset called Laryngoscope8, which comprises 3057 images of 1950 unique individuals, and the images have been labeled with one of eight labels (including seven pathological labels and one normal label) by professional otolaryngologists. We also propose a laryngeal disease classification method, which uses attention mechanism to obtain the critical area under the supervision of image labels for laryngeal disease classification. That is, we first train a CNN model to classify the laryngeal images. If the classification result is correct, the region with strong response is most likely a critical area. The regions with strong responses are used as training data to train an object localization model that can automatically locate the critical area. Given an image for classification, the trained object localization model is employed to locate the critical area. Then, the located critical area is employed for image classification. The entire process only requires image-level labels and does not require manual labeling of the critical area. Experiment results show that the proposed method achieves promising performance in laryngeal disease classification.
AB - Laryngeal disease is a common disease worldwide. However, currently there are no public laryngeal image datasets, which hinders the development of automatic classification of laryngeal disease. In this work, we build a new laryngeal image dataset called Laryngoscope8, which comprises 3057 images of 1950 unique individuals, and the images have been labeled with one of eight labels (including seven pathological labels and one normal label) by professional otolaryngologists. We also propose a laryngeal disease classification method, which uses attention mechanism to obtain the critical area under the supervision of image labels for laryngeal disease classification. That is, we first train a CNN model to classify the laryngeal images. If the classification result is correct, the region with strong response is most likely a critical area. The regions with strong responses are used as training data to train an object localization model that can automatically locate the critical area. Given an image for classification, the trained object localization model is employed to locate the critical area. Then, the located critical area is employed for image classification. The entire process only requires image-level labels and does not require manual labeling of the critical area. Experiment results show that the proposed method achieves promising performance in laryngeal disease classification.
KW - Attention mechanism
KW - Laryngeal disease classification
KW - Laryngeal image dataset
UR - http://www.scopus.com/inward/record.url?scp=85111599995&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2021.06.034
DO - 10.1016/j.patrec.2021.06.034
M3 - Article
AN - SCOPUS:85111599995
SN - 0167-8655
VL - 150
SP - 207
EP - 213
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
ER -