A Perceptually Motivated Approach for Low-Complexity Speech Semantic Communication

Xiaojiao Chen, Jing Wang, Liang Xu, Jingxuan Huang*, Zesong Fei

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Deep learning-based semantic communication is an emerging communication method that achieves cooperative transmission between source and channel. The primary objectives of semantic communication are to enhance the efficiency of information transmission and ensure the accurate restoration of semantic content. Recent studies have shown that semantic communication performs well in enhancing transmission rates, especially in low-signal-to-noise ratio environments. However, existing speech semantic communication methods neglect to account for speech perception at the receiver and the complexity of the method, which limits the practical implementation of semantic communication methods. In this article, we propose a perceptually motivated, low-complexity speech semantic communication method. Specifically, we employ an end-to-end communication approach to transmit the source speech and obtain the reconstructed speech at the receiver. To ensure the accurate extraction of semantic information, we present a low-complexity fully convolutional semantic encoder, which increases the accuracy of semantic information extraction and improves transmission efficiency. Considering the sensitivity of human perception, a multiresolution joint loss function has been implemented to enhance the model's performance and guarantee that the reconstructed speech aligns with the human ear's auditory perception. Experimental results show that the proposed method performs better on objective and subjective metrics than existing speech transmission methods. Compared with existing neural semantic transmission methods, we improve the transmission efficiency, and the number of symbols needed for transmission is decreased by 60% without compromising the quality of speech. Furthermore, the proposed semantic communication method has a lower complexity and consumes less time to transmit.

源语言英语
页(从-至)22054-22065
页数12
期刊IEEE Internet of Things Journal
11
12
DOI
出版状态已出版 - 15 6月 2024

指纹

探究 'A Perceptually Motivated Approach for Low-Complexity Speech Semantic Communication' 的科研主题。它们共同构成独一无二的指纹。

引用此