Speaker recognition based on lightweight neural network for smart home solutions

Haojun Ai; Wuyang Xia; Quanxin Zhang

doi:10.1007/978-3-030-37352-8_37

Speaker recognition based on lightweight neural network for smart home solutions

Haojun Ai^*, Wuyang Xia, Quanxin Zhang

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Citation (Scopus)

Abstract

With the technological advancement of smart home devices, the lifestyles of people have been gradually changed. Meanwhile, speaker recognition is available in almost all smart home devices. Currently, the mainstream speaker recognition service is provided by a very deep neural network which trained on the cloud server. However, these deep neural networks are not suitable for deployment and operation on smart home devices. Aiming at this problem, in this paper, we propose a packet bottleneck method to improve SqueezeNet which has been widely used in the speaker recognition task. In the meantime, a lightweight structure named TrimNet has been designed. Besides, a model updating strategy based on transfer learning has been adopted to avoid model deteriorates due to the cold speech. The experimental results demonstrate that the proposed lightweight structure TrimNet is superior to SqueezeNet in classification accuracy, structural parameter quantity, and calculation amount. Moreover, the model updating method can increase the recognition rate of cold speech without damaging the recognition rate of other speakers.

Original language	English
Title of host publication	Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings
Editors	Jaideep Vaidya, Xiao Zhang, Jin Li
Publisher	Springer
Pages	421-431
Number of pages	11
ISBN (Print)	9783030373511
DOIs	https://doi.org/10.1007/978-3-030-37352-8_37
Publication status	Published - 2019
Event	11th International Symposium on Cyberspace Safety and Security, CSS 2019 - Guangzhou, China Duration: 1 Dec 2019 → 3 Dec 2019

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	11983 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	11th International Symposium on Cyberspace Safety and Security, CSS 2019
Country/Territory	China
City	Guangzhou
Period	1/12/19 → 3/12/19

Keywords

Smart home
Speaker recognition
Transfer learning

Access to Document

10.1007/978-3-030-37352-8_37

Cite this

Ai, H., Xia, W., & Zhang, Q. (2019). Speaker recognition based on lightweight neural network for smart home solutions. In J. Vaidya, X. Zhang, & J. Li (Eds.), Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings (pp. 421-431). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11983 LNCS). Springer. https://doi.org/10.1007/978-3-030-37352-8_37

Ai, Haojun ; Xia, Wuyang ; Zhang, Quanxin. / Speaker recognition based on lightweight neural network for smart home solutions. Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings. editor / Jaideep Vaidya ; Xiao Zhang ; Jin Li. Springer, 2019. pp. 421-431 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{5ef13ec227634d2e89380ea186e055df,

title = "Speaker recognition based on lightweight neural network for smart home solutions",

abstract = "With the technological advancement of smart home devices, the lifestyles of people have been gradually changed. Meanwhile, speaker recognition is available in almost all smart home devices. Currently, the mainstream speaker recognition service is provided by a very deep neural network which trained on the cloud server. However, these deep neural networks are not suitable for deployment and operation on smart home devices. Aiming at this problem, in this paper, we propose a packet bottleneck method to improve SqueezeNet which has been widely used in the speaker recognition task. In the meantime, a lightweight structure named TrimNet has been designed. Besides, a model updating strategy based on transfer learning has been adopted to avoid model deteriorates due to the cold speech. The experimental results demonstrate that the proposed lightweight structure TrimNet is superior to SqueezeNet in classification accuracy, structural parameter quantity, and calculation amount. Moreover, the model updating method can increase the recognition rate of cold speech without damaging the recognition rate of other speakers.",

keywords = "Smart home, Speaker recognition, Transfer learning",

author = "Haojun Ai and Wuyang Xia and Quanxin Zhang",

note = "Publisher Copyright: {\textcopyright} 2019, Springer Nature Switzerland AG.; 11th International Symposium on Cyberspace Safety and Security, CSS 2019 ; Conference date: 01-12-2019 Through 03-12-2019",

year = "2019",

doi = "10.1007/978-3-030-37352-8_37",

language = "English",

isbn = "9783030373511",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "421--431",

editor = "Jaideep Vaidya and Xiao Zhang and Jin Li",

booktitle = "Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings",

address = "Germany",

}

Ai, H, Xia, W & Zhang, Q 2019, Speaker recognition based on lightweight neural network for smart home solutions. in J Vaidya, X Zhang & J Li (eds), Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11983 LNCS, Springer, pp. 421-431, 11th International Symposium on Cyberspace Safety and Security, CSS 2019, Guangzhou, China, 1/12/19. https://doi.org/10.1007/978-3-030-37352-8_37

Speaker recognition based on lightweight neural network for smart home solutions. / Ai, Haojun; Xia, Wuyang; Zhang, Quanxin.
Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings. ed. / Jaideep Vaidya; Xiao Zhang; Jin Li. Springer, 2019. p. 421-431 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11983 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Speaker recognition based on lightweight neural network for smart home solutions

AU - Ai, Haojun

AU - Xia, Wuyang

AU - Zhang, Quanxin

PY - 2019

Y1 - 2019

N2 - With the technological advancement of smart home devices, the lifestyles of people have been gradually changed. Meanwhile, speaker recognition is available in almost all smart home devices. Currently, the mainstream speaker recognition service is provided by a very deep neural network which trained on the cloud server. However, these deep neural networks are not suitable for deployment and operation on smart home devices. Aiming at this problem, in this paper, we propose a packet bottleneck method to improve SqueezeNet which has been widely used in the speaker recognition task. In the meantime, a lightweight structure named TrimNet has been designed. Besides, a model updating strategy based on transfer learning has been adopted to avoid model deteriorates due to the cold speech. The experimental results demonstrate that the proposed lightweight structure TrimNet is superior to SqueezeNet in classification accuracy, structural parameter quantity, and calculation amount. Moreover, the model updating method can increase the recognition rate of cold speech without damaging the recognition rate of other speakers.

AB - With the technological advancement of smart home devices, the lifestyles of people have been gradually changed. Meanwhile, speaker recognition is available in almost all smart home devices. Currently, the mainstream speaker recognition service is provided by a very deep neural network which trained on the cloud server. However, these deep neural networks are not suitable for deployment and operation on smart home devices. Aiming at this problem, in this paper, we propose a packet bottleneck method to improve SqueezeNet which has been widely used in the speaker recognition task. In the meantime, a lightweight structure named TrimNet has been designed. Besides, a model updating strategy based on transfer learning has been adopted to avoid model deteriorates due to the cold speech. The experimental results demonstrate that the proposed lightweight structure TrimNet is superior to SqueezeNet in classification accuracy, structural parameter quantity, and calculation amount. Moreover, the model updating method can increase the recognition rate of cold speech without damaging the recognition rate of other speakers.

KW - Smart home

KW - Speaker recognition

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85078518152&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-37352-8_37

DO - 10.1007/978-3-030-37352-8_37

M3 - Conference contribution

AN - SCOPUS:85078518152

SN - 9783030373511

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 421

EP - 431

BT - Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings

A2 - Vaidya, Jaideep

A2 - Zhang, Xiao

A2 - Li, Jin

PB - Springer

T2 - 11th International Symposium on Cyberspace Safety and Security, CSS 2019

Y2 - 1 December 2019 through 3 December 2019

ER -

Ai H, Xia W, Zhang Q. Speaker recognition based on lightweight neural network for smart home solutions. In Vaidya J, Zhang X, Li J, editors, Cyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings. Springer. 2019. p. 421-431. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-37352-8_37

Speaker recognition based on lightweight neural network for smart home solutions

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this