Agent-driven Generative Semantic Communication with Cross-Modality and Prediction

Wanting Yang; Zehui Xiong; Yanli Yuan; Wenchao Jiang; Tony Q.S. Quek; Merouane Debbah

doi:10.1109/TWC.2024.3519325

Agent-driven Generative Semantic Communication with Cross-Modality and Prediction

Wanting Yang, Zehui Xiong^*, Yanli Yuan^*, Wenchao Jiang, Tony Q.S. Quek, Merouane Debbah

^*此作品的通讯作者

网络空间安全学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In the era of 6G, with compelling visions of intelligent transportation systems and digital twins, remote surveillance is poised to become a ubiquitous practice. Substantial data volume and frequent updates present challenges in wireless networks. To address these challenges, we propose a novel agent-driven generative semantic communication (A-GSC) framework based on reinforcement learning. In contrast to the existing research on semantic communication (SemCom), which mainly focuses on either semantic extraction or semantic sampling, we seamlessly integrate both by jointly considering the intrinsic attributes of source information and the contextual information regarding the task. Notably, the introduction of generative artificial intelligence (GAI) enables the independent design of semantic encoders and decoders. In this work, we develop an agent-assisted semantic encoder with cross-modality capability, which can track the semantic changes, channel condition, to perform adaptive semantic extraction and sampling. Accordingly, we design a semantic decoder with both predictive and generative capabilities, consisting of two tailored modules. Moreover, the effectiveness of the designed models has been verified using the UA-DETRAC dataset, demonstrating the performance gains of the overall A-GSC framework in both energy saving and reconstruction accuracy.

源语言	英语
期刊	IEEE Transactions on Wireless Communications
DOI	https://doi.org/10.1109/TWC.2024.3519325
出版状态	已接受/待刊 - 2024

访问文件

10.1109/TWC.2024.3519325

其它文件与链接

链接到 Scopus 的出版物

引用此

Yang, W., Xiong, Z., Yuan, Y., Jiang, W., Quek, T. Q. S., & Debbah, M. (已接受/印刷中). Agent-driven Generative Semantic Communication with Cross-Modality and Prediction. IEEE Transactions on Wireless Communications. https://doi.org/10.1109/TWC.2024.3519325

@article{cf692cd43796424ca8c81659e47a6ca9,

title = "Agent-driven Generative Semantic Communication with Cross-Modality and Prediction",

abstract = "In the era of 6G, with compelling visions of intelligent transportation systems and digital twins, remote surveillance is poised to become a ubiquitous practice. Substantial data volume and frequent updates present challenges in wireless networks. To address these challenges, we propose a novel agent-driven generative semantic communication (A-GSC) framework based on reinforcement learning. In contrast to the existing research on semantic communication (SemCom), which mainly focuses on either semantic extraction or semantic sampling, we seamlessly integrate both by jointly considering the intrinsic attributes of source information and the contextual information regarding the task. Notably, the introduction of generative artificial intelligence (GAI) enables the independent design of semantic encoders and decoders. In this work, we develop an agent-assisted semantic encoder with cross-modality capability, which can track the semantic changes, channel condition, to perform adaptive semantic extraction and sampling. Accordingly, we design a semantic decoder with both predictive and generative capabilities, consisting of two tailored modules. Moreover, the effectiveness of the designed models has been verified using the UA-DETRAC dataset, demonstrating the performance gains of the overall A-GSC framework in both energy saving and reconstruction accuracy.",

keywords = "Semantic communication, deep reinforcement learning, diffusion model, semantic sampling, video streaming",

author = "Wanting Yang and Zehui Xiong and Yanli Yuan and Wenchao Jiang and Quek, {Tony Q.S.} and Merouane Debbah",

note = "Publisher Copyright: {\textcopyright} 2002-2012 IEEE.",

year = "2024",

doi = "10.1109/TWC.2024.3519325",

language = "English",

journal = "IEEE Transactions on Wireless Communications",

issn = "1536-1276",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Agent-driven Generative Semantic Communication with Cross-Modality and Prediction

AU - Yang, Wanting

AU - Xiong, Zehui

AU - Yuan, Yanli

AU - Jiang, Wenchao

AU - Quek, Tony Q.S.

AU - Debbah, Merouane

PY - 2024

Y1 - 2024

N2 - In the era of 6G, with compelling visions of intelligent transportation systems and digital twins, remote surveillance is poised to become a ubiquitous practice. Substantial data volume and frequent updates present challenges in wireless networks. To address these challenges, we propose a novel agent-driven generative semantic communication (A-GSC) framework based on reinforcement learning. In contrast to the existing research on semantic communication (SemCom), which mainly focuses on either semantic extraction or semantic sampling, we seamlessly integrate both by jointly considering the intrinsic attributes of source information and the contextual information regarding the task. Notably, the introduction of generative artificial intelligence (GAI) enables the independent design of semantic encoders and decoders. In this work, we develop an agent-assisted semantic encoder with cross-modality capability, which can track the semantic changes, channel condition, to perform adaptive semantic extraction and sampling. Accordingly, we design a semantic decoder with both predictive and generative capabilities, consisting of two tailored modules. Moreover, the effectiveness of the designed models has been verified using the UA-DETRAC dataset, demonstrating the performance gains of the overall A-GSC framework in both energy saving and reconstruction accuracy.

AB - In the era of 6G, with compelling visions of intelligent transportation systems and digital twins, remote surveillance is poised to become a ubiquitous practice. Substantial data volume and frequent updates present challenges in wireless networks. To address these challenges, we propose a novel agent-driven generative semantic communication (A-GSC) framework based on reinforcement learning. In contrast to the existing research on semantic communication (SemCom), which mainly focuses on either semantic extraction or semantic sampling, we seamlessly integrate both by jointly considering the intrinsic attributes of source information and the contextual information regarding the task. Notably, the introduction of generative artificial intelligence (GAI) enables the independent design of semantic encoders and decoders. In this work, we develop an agent-assisted semantic encoder with cross-modality capability, which can track the semantic changes, channel condition, to perform adaptive semantic extraction and sampling. Accordingly, we design a semantic decoder with both predictive and generative capabilities, consisting of two tailored modules. Moreover, the effectiveness of the designed models has been verified using the UA-DETRAC dataset, demonstrating the performance gains of the overall A-GSC framework in both energy saving and reconstruction accuracy.

KW - Semantic communication

KW - deep reinforcement learning

KW - diffusion model

KW - semantic sampling

KW - video streaming

UR - http://www.scopus.com/inward/record.url?scp=85213549344&partnerID=8YFLogxK

U2 - 10.1109/TWC.2024.3519325

DO - 10.1109/TWC.2024.3519325

M3 - Article

AN - SCOPUS:85213549344

SN - 1536-1276

JO - IEEE Transactions on Wireless Communications

JF - IEEE Transactions on Wireless Communications

ER -

Agent-driven Generative Semantic Communication with Cross-Modality and Prediction

摘要

访问文件

其它文件与链接

指纹

引用此