An improvement of the degradation of speaker recognition in continuous cold speech for home assistant

Haojun Ai, Yifeng Wang, Yuhong Yang*, Quanxin Zhang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)

摘要

Home assistant with speech user interfaces is quite welcomed due to its convenience in recent years. With speaker recognition (SR) technology in this application, personalized services (e.g., playing music, making to-do lists) for different family members become reality. However, the SR accuracy may decline sharply when a family has a cold due to the restriction of hardware and response time. In this paper, we propose a dual model updating strategy based on cold detection to maintain all speaker voice models. In this method, time domain and frequency domain features would be combined to detect continuous cold speech. And then, corresponding models would be selected to determine the identity according to the results of the detection. In order to continuously track SR performance based on data of mobile phone usage, a new mobile phone-based speech dataset (PBSD) which contains voice, phone model, and user’s state of physical wellness has been constructed. Besides, the relationship between SR accuracy and users’ state of physical wellness also has been analyzed based on a GMM-UBM framework. Finally, to evaluate performance of the proposed method, experiments focused on SR accuracy of 10 speakers from both cold-suffering and healthy states have been conducted. The results demonstrated that the SR accuracy can be improved effectively by the cold detection-based model updating strategy, especially in a cold-suffering circumstance.

源语言英语
主期刊名Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings
编辑Jaideep Vaidya, Xiao Zhang, Jin Li
出版商Springer
363-373
页数11
ISBN(印刷版)9783030373368
DOI
出版状态已出版 - 2019
活动11th International Symposium on Cyberspace Safety and Security, CSS 2019 - Guangzhou, 中国
期限: 1 12月 20193 12月 2019

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
11982 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议11th International Symposium on Cyberspace Safety and Security, CSS 2019
国家/地区中国
Guangzhou
时期1/12/193/12/19

指纹

探究 'An improvement of the degradation of speaker recognition in continuous cold speech for home assistant' 的科研主题。它们共同构成独一无二的指纹。

引用此