TY - JOUR
T1 - Token Communications
T2 - A Large Model-Driven Framework for Cross-Modal Context-Aware Semantic Communications
AU - Qiao, Li
AU - Mashhadi, Mahdi Boloursaz
AU - Gao, Zhen
AU - Tafazolli, Rahim
AU - Bennis, Mehdi
AU - Niyato, Dusit
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - In this article, we introduce token communications (TokCom), a large model-driven framework to leverage cross-modal context information in generative semantic communications (GenSC). TokCom is a new paradigm, motivated by the recent success of generative foundation models and multimodal large language models (GFM/MLLMs), where the communication units are tokens, enabling efficient transformer-based token processing at the transmitter and receiver. In this article, we introduce the potential opportunities and challenges of leveraging context in GenSC, explore how to integrate GFM/MLLMs-based token processing into semantic communication systems to leverage cross-modal context effectively at affordable complexity, present the key principles for efficient TokCom at various layers in future wireless networks. In a typical image semantic communication setup, we demonstrate a significant improvement of the bandwidth efficiency, achieved by TokCom by leveraging the context information among tokens. Finally, the potential research directions are identified to facilitate adoption of TokCom in future wireless networks.
AB - In this article, we introduce token communications (TokCom), a large model-driven framework to leverage cross-modal context information in generative semantic communications (GenSC). TokCom is a new paradigm, motivated by the recent success of generative foundation models and multimodal large language models (GFM/MLLMs), where the communication units are tokens, enabling efficient transformer-based token processing at the transmitter and receiver. In this article, we introduce the potential opportunities and challenges of leveraging context in GenSC, explore how to integrate GFM/MLLMs-based token processing into semantic communication systems to leverage cross-modal context effectively at affordable complexity, present the key principles for efficient TokCom at various layers in future wireless networks. In a typical image semantic communication setup, we demonstrate a significant improvement of the bandwidth efficiency, achieved by TokCom by leveraging the context information among tokens. Finally, the potential research directions are identified to facilitate adoption of TokCom in future wireless networks.
UR - https://www.scopus.com/pages/publications/105017373897
U2 - 10.1109/MWC.001.2500084
DO - 10.1109/MWC.001.2500084
M3 - Article
AN - SCOPUS:105017373897
SN - 1536-1284
VL - 32
SP - 80
EP - 88
JO - IEEE Wireless Communications
JF - IEEE Wireless Communications
IS - 5
ER -