A Target Speaker Separation Neural Network with Joint-Training

Wenjing Yang; Jing Wang; Hongfeng Li; Na Xu; Fei Xiang; Kai Qian; Shenghua Hu

A Target Speaker Separation Neural Network with Joint-Training

Wenjing Yang, Jing Wang, Hongfeng Li, Na Xu, Fei Xiang, Kai Qian, Shenghua Hu

信息与电子学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

2 引用（Scopus）

摘要

Target speaker separation aims to separate a target speech from multiple interference voices, which is promising for solving conventional difficulties in speech separation, such as arbitrary source permutation and unknown number of sources, and is useful for personal applications, like online meeting and personal phone calls. Recently, the application of deep-learning based models provided more alternatives for target speaker separation tasks. In this paper, we proposed a target speaker separation neural network with joint-training that separates the target voice in the spectrogram domain with the proposed combinative loss function. Experimental results show that compared with the baseline, our proposed method yields better performance on both test data and real data. Meanwhile, the proposed combinative loss function is more effective in addressing this issue.

源语言	英语
主期刊名	2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings
出版商	Institute of Electrical and Electronics Engineers Inc.
页	614-618
页数	5
ISBN（电子版）	9789881476890
出版状态	已出版 - 2021
活动	2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Tokyo, 日本期限: 14 12月 2021 → 17 12月 2021

出版系列

姓名	2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings

会议

会议	2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021
国家/地区	日本
市	Tokyo
时期	14/12/21 → 17/12/21

其它文件与链接

链接到 Scopus 的出版物

引用此

Yang, W., Wang, J., Li, H., Xu, N., Xiang, F., Qian, K., & Hu, S. (2021). A Target Speaker Separation Neural Network with Joint-Training. 在 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings (页码 614-618). (2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings). Institute of Electrical and Electronics Engineers Inc..

Yang, Wenjing ; Wang, Jing ; Li, Hongfeng 等. / A Target Speaker Separation Neural Network with Joint-Training. 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2021. 页码 614-618 (2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings).

@inproceedings{fb394bd6155144f086560ffbbe919d8e,

title = "A Target Speaker Separation Neural Network with Joint-Training",

abstract = "Target speaker separation aims to separate a target speech from multiple interference voices, which is promising for solving conventional difficulties in speech separation, such as arbitrary source permutation and unknown number of sources, and is useful for personal applications, like online meeting and personal phone calls. Recently, the application of deep-learning based models provided more alternatives for target speaker separation tasks. In this paper, we proposed a target speaker separation neural network with joint-training that separates the target voice in the spectrogram domain with the proposed combinative loss function. Experimental results show that compared with the baseline, our proposed method yields better performance on both test data and real data. Meanwhile, the proposed combinative loss function is more effective in addressing this issue.",

author = "Wenjing Yang and Jing Wang and Hongfeng Li and Na Xu and Fei Xiang and Kai Qian and Shenghua Hu",

note = "Publisher Copyright: {\textcopyright} 2021 APSIPA.; 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 ; Conference date: 14-12-2021 Through 17-12-2021",

year = "2021",

language = "English",

series = "2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "614--618",

booktitle = "2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings",

address = "United States",

}

Yang, W, Wang, J, Li, H, Xu, N, Xiang, F, Qian, K & Hu, S 2021, A Target Speaker Separation Neural Network with Joint-Training. 在 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings. 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings, Institute of Electrical and Electronics Engineers Inc., 页码 614-618, 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021, Tokyo, 日本, 14/12/21.

A Target Speaker Separation Neural Network with Joint-Training. / Yang, Wenjing; Wang, Jing; Li, Hongfeng 等.
2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2021. 页码 614-618 (2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - A Target Speaker Separation Neural Network with Joint-Training

AU - Yang, Wenjing

AU - Wang, Jing

AU - Li, Hongfeng

AU - Xu, Na

AU - Xiang, Fei

AU - Qian, Kai

AU - Hu, Shenghua

PY - 2021

Y1 - 2021

N2 - Target speaker separation aims to separate a target speech from multiple interference voices, which is promising for solving conventional difficulties in speech separation, such as arbitrary source permutation and unknown number of sources, and is useful for personal applications, like online meeting and personal phone calls. Recently, the application of deep-learning based models provided more alternatives for target speaker separation tasks. In this paper, we proposed a target speaker separation neural network with joint-training that separates the target voice in the spectrogram domain with the proposed combinative loss function. Experimental results show that compared with the baseline, our proposed method yields better performance on both test data and real data. Meanwhile, the proposed combinative loss function is more effective in addressing this issue.

AB - Target speaker separation aims to separate a target speech from multiple interference voices, which is promising for solving conventional difficulties in speech separation, such as arbitrary source permutation and unknown number of sources, and is useful for personal applications, like online meeting and personal phone calls. Recently, the application of deep-learning based models provided more alternatives for target speaker separation tasks. In this paper, we proposed a target speaker separation neural network with joint-training that separates the target voice in the spectrogram domain with the proposed combinative loss function. Experimental results show that compared with the baseline, our proposed method yields better performance on both test data and real data. Meanwhile, the proposed combinative loss function is more effective in addressing this issue.

UR - http://www.scopus.com/inward/record.url?scp=85126681786&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85126681786

T3 - 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings

SP - 614

EP - 618

BT - 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021

Y2 - 14 December 2021 through 17 December 2021

ER -

Yang W, Wang J, Li H, Xu N, Xiang F, Qian K 等. A Target Speaker Separation Neural Network with Joint-Training. 在 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2021. 页码 614-618. (2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings).

A Target Speaker Separation Neural Network with Joint-Training

摘要

出版系列

会议

其它文件与链接

指纹

引用此