TY - GEN
T1 - Inspiration of Prototype Knowledge
T2 - 2024 IEEE International Conference on E-Health Networking, Application and Services, HealthCom 2024
AU - Wu, Qingrong
AU - Sun, Mengkai
AU - Zhang, Haojie
AU - Zhang, Yongxin
AU - Liu, Yiming
AU - Meng, Boyang
AU - Qian, Kun
AU - Hu, Bin
AU - Nakamura, Toru
AU - Nomura, Taishin
AU - Schuller, Björn W.
AU - Yamamoto, Yoshiharu
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Cardiovascular diseases (CVDs) stand as the primary reason of fatalities globally, especially in low- and middle-income countries. In recent years, with the leverage of computer audition technologies, the diagnosis of CVDs through heart sounds become a popular topic. Current models and techniques are trained, validated, and tested on the same dataset, which need to be retrained when encountering new data. To make the best use of sparse data, we propose a Prototypical Network framework with heuristic weight for heart sound recognition. After extracting two different features (Mel Spectrogram and Mel Frequency Cepstral Coefficients) and encoding the features, we calculate the distance between two categories (normal and abnormal), then, a heuristic weight is assigned to the distance that makes the blurred boundaries more distinct. By considering the subject independence, the Unweighted Average Recall (UAR) on the PhysioNet/CinC Challenge 2016 is 68.2 % and 67.7 % on two features, respectively. The capability of our model to work on different datasets is proved by a UAR of 66.4 %, which exceeds the baseline UAR of 58.6 % under a single model.
AB - Cardiovascular diseases (CVDs) stand as the primary reason of fatalities globally, especially in low- and middle-income countries. In recent years, with the leverage of computer audition technologies, the diagnosis of CVDs through heart sounds become a popular topic. Current models and techniques are trained, validated, and tested on the same dataset, which need to be retrained when encountering new data. To make the best use of sparse data, we propose a Prototypical Network framework with heuristic weight for heart sound recognition. After extracting two different features (Mel Spectrogram and Mel Frequency Cepstral Coefficients) and encoding the features, we calculate the distance between two categories (normal and abnormal), then, a heuristic weight is assigned to the distance that makes the blurred boundaries more distinct. By considering the subject independence, the Unweighted Average Recall (UAR) on the PhysioNet/CinC Challenge 2016 is 68.2 % and 67.7 % on two features, respectively. The capability of our model to work on different datasets is proved by a UAR of 66.4 %, which exceeds the baseline UAR of 58.6 % under a single model.
KW - computer audition
KW - heart sound classification
KW - meta-learning
KW - model adaption
KW - prototypical network
UR - http://www.scopus.com/inward/record.url?scp=85219588108&partnerID=8YFLogxK
U2 - 10.1109/HEALTHCOM60970.2024.10880804
DO - 10.1109/HEALTHCOM60970.2024.10880804
M3 - Conference contribution
AN - SCOPUS:85219588108
T3 - 2024 IEEE International Conference on E-Health Networking, Application and Services, HealthCom 2024
BT - 2024 IEEE International Conference on E-Health Networking, Application and Services, HealthCom 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 18 November 2024 through 20 November 2024
ER -