TY - JOUR
T1 - UAV Carrier Enabled Vehicular Crowdsensing by Multi-Agent Reinforcement Learning with Mutual Policy Divergence and Attentive Memory Update
AU - Zhao, Qiran
AU - Liu, Chi Harold
AU - Zhao, Jianxin
AU - Li, Guozheng
AU - Qi, Guangpeng
AU - Ji, Xu
AU - Xu, Duo
AU - Crowcroft, Jon
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2026
Y1 - 2026
N2 - Vehicular Crowdsensing (VCS) has emerged as a promising paradigm that leverages the complementary strengths of unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs) for large-scale urban sensing and data collection. In this paper, we consider a UAV-carrier-enabled VCS campaign in which UGVs dynamically dispatch and recall UAVs within the workzone, where UAVs sense points of interest (PoIs) and UGVs facilitate data collection, with the goal of maximizing the total collected data volume and geographic fairness, while minimizing overall energy consumption. We propose a heterogeneous multi-agent deep reinforcement learning (MADRL) framework, called “HADRL-VCS”, consisting of an attentive memory-integrated information exchange mechanism that enables UAVs and UGVs to fuse newly received information with historical memory, thereby expanding the collective sensing range and enhancing cooperative decision-making. We also propose a mutual policy divergence-driven exploration strategy designed to explicitly promote diverse exploration and complementary role differentiation among heterogeneous UAVs and UGVs. Extensive experimental results based on realistic simulations using real-world urban maps from Guangzhou, China, and Madrid, Spain, show that HADRL-VCS achieves better performance over five baselines in terms of data collection ratio, geographic fairness, sensing range expansion ratio, overlap ratio, and efficiency.
AB - Vehicular Crowdsensing (VCS) has emerged as a promising paradigm that leverages the complementary strengths of unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs) for large-scale urban sensing and data collection. In this paper, we consider a UAV-carrier-enabled VCS campaign in which UGVs dynamically dispatch and recall UAVs within the workzone, where UAVs sense points of interest (PoIs) and UGVs facilitate data collection, with the goal of maximizing the total collected data volume and geographic fairness, while minimizing overall energy consumption. We propose a heterogeneous multi-agent deep reinforcement learning (MADRL) framework, called “HADRL-VCS”, consisting of an attentive memory-integrated information exchange mechanism that enables UAVs and UGVs to fuse newly received information with historical memory, thereby expanding the collective sensing range and enhancing cooperative decision-making. We also propose a mutual policy divergence-driven exploration strategy designed to explicitly promote diverse exploration and complementary role differentiation among heterogeneous UAVs and UGVs. Extensive experimental results based on realistic simulations using real-world urban maps from Guangzhou, China, and Madrid, Spain, show that HADRL-VCS achieves better performance over five baselines in terms of data collection ratio, geographic fairness, sensing range expansion ratio, overlap ratio, and efficiency.
KW - Multi-agent deep reinforcement learning
KW - UAV carrier
KW - Vehicular crowdsensing
UR - https://www.scopus.com/pages/publications/105039326273
U2 - 10.1109/TMC.2026.3693470
DO - 10.1109/TMC.2026.3693470
M3 - Article
AN - SCOPUS:105039326273
SN - 1536-1233
JO - IEEE Transactions on Mobile Computing
JF - IEEE Transactions on Mobile Computing
ER -