Abstract
Unmanned aerial vehicle (UAV) crowdsensing (UCS) is an emerging data collection paradigm to provide reliable and high quality urban sensing services, with age-of-information (AoI) requirement to measure data freshness in real-time applications. In this paper, we explicitly consider the case to ensure that the attained AoI always stay within a specific threshold. The goal is to maximize the total amount of collected data from diverse Point-of-Interests (PoIs) while minimizing AoI and AoI threshold violation ratio under limited energy supplement. To this end, we propose a decentralized multi-agent deep reinforcement learning framework called 'DRL-UCS( AoI_th )' for multi-UAV trajectory planning, which consists of a novel transformer-enhanced distributed architecture and an adaptive intrinsic reward mechanism for spatial cooperation and exploration. Extensive results and trajectory visualization on two real-world datasets in Beijing and San Francisco show that, DRL-UCS( AoI_th ) consistently outperforms all nine baselines when varying the number of UAVs, AoI threshold and generated data amount in a timeslot.
Original language | English |
---|---|
Pages (from-to) | 566-581 |
Number of pages | 16 |
Journal | IEEE/ACM Transactions on Networking |
Volume | 32 |
Issue number | 1 |
DOIs | |
Publication status | Published - 1 Feb 2024 |
Keywords
- AoI
- UAV crowdsensing
- multi-agent deep reinforcement learning
- transformer