An Improved LSTM for Language Identification

Qingran Zhan, Liqiang Zhang, Hui Deng, Xiang Xie

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

In this paper, we propose a novel framework by combining the phonetic temporal neural model (PTN) with an improved LSTM (IM-LSTM). This is achieved by using an up-down connection from the time t to t+1 in the LSTM structure, which aims to capture the latent information from the previous time step. This updated structure can perform better to discriminate the frame-level phonetic information produced by PTN. On the AP16-OLR language identification dataset, our final model achieves relative growth rate 5.04%, 2.19%, 2.73% on EER and 6.55%, 5.81%, 2.23% on Cavg in 1s, 3s and full-length utterance condition than the standard PTN, respectively. The proposed framework receives a better performance than the standard PTN and other proposed models, particularly in 1s condition. This shows the efficacy and flexibility of the proposed method.

源语言英语
主期刊名2018 14th IEEE International Conference on Signal Processing Proceedings, ICSP 2018
编辑Yuan Baozong, Ruan Qiuqi, Zhao Yao, An Gaoyun
出版商Institute of Electrical and Electronics Engineers Inc.
609-612
页数4
ISBN(电子版)9781538646724
DOI
出版状态已出版 - 2 2月 2019
活动14th IEEE International Conference on Signal Processing, ICSP 2018 - Beijing, 中国
期限: 12 8月 201816 8月 2018

出版系列

姓名International Conference on Signal Processing Proceedings, ICSP
2018-August

会议

会议14th IEEE International Conference on Signal Processing, ICSP 2018
国家/地区中国
Beijing
时期12/08/1816/08/18

指纹

探究 'An Improved LSTM for Language Identification' 的科研主题。它们共同构成独一无二的指纹。

引用此