TY - JOUR
T1 - A Perceptually Motivated Approach for Low-Complexity Speech Semantic Communication
AU - Chen, Xiaojiao
AU - Wang, Jing
AU - Xu, Liang
AU - Huang, Jingxuan
AU - Fei, Zesong
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2024/6/15
Y1 - 2024/6/15
N2 - Deep learning-based semantic communication is an emerging communication method that achieves cooperative transmission between source and channel. The primary objectives of semantic communication are to enhance the efficiency of information transmission and ensure the accurate restoration of semantic content. Recent studies have shown that semantic communication performs well in enhancing transmission rates, especially in low-signal-to-noise ratio environments. However, existing speech semantic communication methods neglect to account for speech perception at the receiver and the complexity of the method, which limits the practical implementation of semantic communication methods. In this article, we propose a perceptually motivated, low-complexity speech semantic communication method. Specifically, we employ an end-to-end communication approach to transmit the source speech and obtain the reconstructed speech at the receiver. To ensure the accurate extraction of semantic information, we present a low-complexity fully convolutional semantic encoder, which increases the accuracy of semantic information extraction and improves transmission efficiency. Considering the sensitivity of human perception, a multiresolution joint loss function has been implemented to enhance the model's performance and guarantee that the reconstructed speech aligns with the human ear's auditory perception. Experimental results show that the proposed method performs better on objective and subjective metrics than existing speech transmission methods. Compared with existing neural semantic transmission methods, we improve the transmission efficiency, and the number of symbols needed for transmission is decreased by 60% without compromising the quality of speech. Furthermore, the proposed semantic communication method has a lower complexity and consumes less time to transmit.
AB - Deep learning-based semantic communication is an emerging communication method that achieves cooperative transmission between source and channel. The primary objectives of semantic communication are to enhance the efficiency of information transmission and ensure the accurate restoration of semantic content. Recent studies have shown that semantic communication performs well in enhancing transmission rates, especially in low-signal-to-noise ratio environments. However, existing speech semantic communication methods neglect to account for speech perception at the receiver and the complexity of the method, which limits the practical implementation of semantic communication methods. In this article, we propose a perceptually motivated, low-complexity speech semantic communication method. Specifically, we employ an end-to-end communication approach to transmit the source speech and obtain the reconstructed speech at the receiver. To ensure the accurate extraction of semantic information, we present a low-complexity fully convolutional semantic encoder, which increases the accuracy of semantic information extraction and improves transmission efficiency. Considering the sensitivity of human perception, a multiresolution joint loss function has been implemented to enhance the model's performance and guarantee that the reconstructed speech aligns with the human ear's auditory perception. Experimental results show that the proposed method performs better on objective and subjective metrics than existing speech transmission methods. Compared with existing neural semantic transmission methods, we improve the transmission efficiency, and the number of symbols needed for transmission is decreased by 60% without compromising the quality of speech. Furthermore, the proposed semantic communication method has a lower complexity and consumes less time to transmit.
KW - Low complexity
KW - multiresolution joint loss
KW - semantic communication
KW - speech transmission
UR - http://www.scopus.com/inward/record.url?scp=85189155360&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2024.3378779
DO - 10.1109/JIOT.2024.3378779
M3 - Article
AN - SCOPUS:85189155360
SN - 2327-4662
VL - 11
SP - 22054
EP - 22065
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 12
ER -