TY - GEN
T1 - Joint Split Inference Design and Bandwidth Allocation for Intelligent Edge Perception Network
AU - Liu, Peng
AU - Tang, Shuntian
AU - Lin, Ke
AU - Ha, Nan
AU - Wang, Xinyi
AU - Fei, Zesong
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - The integration of artificial intelligence and wireless sensing at the network edge offers significant potential to alleviate device-side computational burdens and reduce the computational delay of the sensing task. However, offloading raw sensing data to Mobile Edge Computing (MEC) servers often leads to substantial transmission delays. To address this, we introduce split inference into edge perception networks and propose a three-tier collaborative inference architecture involving cloud servers, MEC servers, and devices. Each tier executes a portion of a Deep Neural Network (DNN)-based sensing task. Leveraging Integrated Sensing and Communication (ISAC) technology, devices simultaneously sense the environment and offload intermediate DNN features to the MEC or cloud server for further inference. We formulate a joint optimization problem to determine the adaptive DNN splitting strategy and bandwidth allocation, aiming to minimize total inference delay. To solve the resulting mixed-integer nonlinear programming problem, we develop a two-layer optimization algorithm based on the Karush-Kuhn-Tucker conditions and the cross-entropy method. Simulation results show that the proposed framework significantly reduces average inference delay compared to existing two-tier collaborative inference schemes.
AB - The integration of artificial intelligence and wireless sensing at the network edge offers significant potential to alleviate device-side computational burdens and reduce the computational delay of the sensing task. However, offloading raw sensing data to Mobile Edge Computing (MEC) servers often leads to substantial transmission delays. To address this, we introduce split inference into edge perception networks and propose a three-tier collaborative inference architecture involving cloud servers, MEC servers, and devices. Each tier executes a portion of a Deep Neural Network (DNN)-based sensing task. Leveraging Integrated Sensing and Communication (ISAC) technology, devices simultaneously sense the environment and offload intermediate DNN features to the MEC or cloud server for further inference. We formulate a joint optimization problem to determine the adaptive DNN splitting strategy and bandwidth allocation, aiming to minimize total inference delay. To solve the resulting mixed-integer nonlinear programming problem, we develop a two-layer optimization algorithm based on the Karush-Kuhn-Tucker conditions and the cross-entropy method. Simulation results show that the proposed framework significantly reduces average inference delay compared to existing two-tier collaborative inference schemes.
KW - Edge perception network
KW - bandwidth allocation
KW - cross-entropy
KW - split inference
UR - https://www.scopus.com/pages/publications/105017579721
U2 - 10.1109/ICCCWorkshops67136.2025.11148170
DO - 10.1109/ICCCWorkshops67136.2025.11148170
M3 - Conference contribution
AN - SCOPUS:105017579721
T3 - 2025 IEEE/CIC International Conference on Communications in China, ICCC Workshops 2025
BT - 2025 IEEE/CIC International Conference on Communications in China, ICCC Workshops 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 IEEE/CIC International Conference on Communications in China, ICCC Workshops 2025
Y2 - 10 August 2025 through 13 August 2025
ER -