跳到主要导航 跳到搜索 跳到主要内容

Token Communications: A Large Model-Driven Framework for Cross-Modal Context-Aware Semantic Communications

  • Li Qiao
  • , Mahdi Boloursaz Mashhadi
  • , Zhen Gao*
  • , Rahim Tafazolli*
  • , Mehdi Bennis
  • , Dusit Niyato
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • University of Surrey
  • University of Oulu
  • Nanyang Technological University

科研成果: 期刊稿件文章同行评审

摘要

In this article, we introduce token communications (TokCom), a large model-driven framework to leverage cross-modal context information in generative semantic communications (GenSC). TokCom is a new paradigm, motivated by the recent success of generative foundation models and multimodal large language models (GFM/MLLMs), where the communication units are tokens, enabling efficient transformer-based token processing at the transmitter and receiver. In this article, we introduce the potential opportunities and challenges of leveraging context in GenSC, explore how to integrate GFM/MLLMs-based token processing into semantic communication systems to leverage cross-modal context effectively at affordable complexity, present the key principles for efficient TokCom at various layers in future wireless networks. In a typical image semantic communication setup, we demonstrate a significant improvement of the bandwidth efficiency, achieved by TokCom by leveraging the context information among tokens. Finally, the potential research directions are identified to facilitate adoption of TokCom in future wireless networks.

源语言英语
页(从-至)80-88
页数9
期刊IEEE Wireless Communications
32
5
DOI
出版状态已出版 - 2025

指纹

探究 'Token Communications: A Large Model-Driven Framework for Cross-Modal Context-Aware Semantic Communications' 的科研主题。它们共同构成独一无二的指纹。

引用此