TY - JOUR
T1 - Generative Artificial intelligence-Enhanced MultiModal Semantic Communication in Internet of Vehicles
T2 - System Design and Methodologies
AU - Lu, Jiayi
AU - Yang, Wanting
AU - Xiong, Zehui
AU - Xing, Chengwen
AU - Tafazolli, Rahim
AU - Quek, Tony Q.S.
AU - Debbah, Merouane
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Vehicle-to-everything (V2X) communication supports numerous tasks, from driving safety to entertainment services. To achieve a holistic view, vehicles are typically equipped with multiple sensors. However, processing large volumes of multimodal data increases transmission load, while the dynamic nature of vehicular networks adds to transmission instability. To address these challenges, we propose a novel framework, generative artificial intelligence (GAI)-enhanced multimodal semantic communication (SemCom), referred to as G-MSC, designed to handle various vehicular network tasks by employing suitable analog or digital transmission. GAI presents a promising opportunity to transform the SemCom framework by significantly enhancing semantic encoding, semantic information transmission, and semantic decoding. It optimizes multimodal information fusion at the transmitter, enhances channel robustness during transmission, and mitigates noise interference at the receiver. To validate the effectiveness of the G-MSC framework, we conduct a case study showcasing its performance in vehicular communication networks for predictive tasks. The experimental results show that the design achieves reliable and efficient communication in V2X networks. In the end, we present future research directions of G-MSC.
AB - Vehicle-to-everything (V2X) communication supports numerous tasks, from driving safety to entertainment services. To achieve a holistic view, vehicles are typically equipped with multiple sensors. However, processing large volumes of multimodal data increases transmission load, while the dynamic nature of vehicular networks adds to transmission instability. To address these challenges, we propose a novel framework, generative artificial intelligence (GAI)-enhanced multimodal semantic communication (SemCom), referred to as G-MSC, designed to handle various vehicular network tasks by employing suitable analog or digital transmission. GAI presents a promising opportunity to transform the SemCom framework by significantly enhancing semantic encoding, semantic information transmission, and semantic decoding. It optimizes multimodal information fusion at the transmitter, enhances channel robustness during transmission, and mitigates noise interference at the receiver. To validate the effectiveness of the G-MSC framework, we conduct a case study showcasing its performance in vehicular communication networks for predictive tasks. The experimental results show that the design achieves reliable and efficient communication in V2X networks. In the end, we present future research directions of G-MSC.
UR - http://www.scopus.com/inward/record.url?scp=105000791308&partnerID=8YFLogxK
U2 - 10.1109/MVT.2025.3545399
DO - 10.1109/MVT.2025.3545399
M3 - Article
AN - SCOPUS:105000791308
SN - 1556-6072
VL - 20
SP - 71
EP - 82
JO - IEEE Vehicular Technology Magazine
JF - IEEE Vehicular Technology Magazine
IS - 2
ER -