Speaker recognition based on lightweight neural network for smart home solutions

Haojun Ai*, Wuyang Xia, Quanxin Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

With the technological advancement of smart home devices, the lifestyles of people have been gradually changed. Meanwhile, speaker recognition is available in almost all smart home devices. Currently, the mainstream speaker recognition service is provided by a very deep neural network which trained on the cloud server. However, these deep neural networks are not suitable for deployment and operation on smart home devices. Aiming at this problem, in this paper, we propose a packet bottleneck method to improve SqueezeNet which has been widely used in the speaker recognition task. In the meantime, a lightweight structure named TrimNet has been designed. Besides, a model updating strategy based on transfer learning has been adopted to avoid model deteriorates due to the cold speech. The experimental results demonstrate that the proposed lightweight structure TrimNet is superior to SqueezeNet in classification accuracy, structural parameter quantity, and calculation amount. Moreover, the model updating method can increase the recognition rate of cold speech without damaging the recognition rate of other speakers.

Original languageEnglish
Title of host publicationCyberspace Safety and Security - 11th International Symposium, CSS 2019, Proceedings
EditorsJaideep Vaidya, Xiao Zhang, Jin Li
PublisherSpringer
Pages421-431
Number of pages11
ISBN (Print)9783030373511
DOIs
Publication statusPublished - 2019
Event11th International Symposium on Cyberspace Safety and Security, CSS 2019 - Guangzhou, China
Duration: 1 Dec 20193 Dec 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11983 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Symposium on Cyberspace Safety and Security, CSS 2019
Country/TerritoryChina
CityGuangzhou
Period1/12/193/12/19

Keywords

  • Smart home
  • Speaker recognition
  • Transfer learning

Fingerprint

Dive into the research topics of 'Speaker recognition based on lightweight neural network for smart home solutions'. Together they form a unique fingerprint.

Cite this