TY - JOUR
T1 - MADDPG Based Path Planning for Multi-UAV Information Collection in IoT Networks
AU - Zhu, Liangbin
AU - Yang, Kai
AU - Li, Jinglei
AU - Yang, Yuxuan
AU - Yang, Qinghai
AU - Gao, Xiaozheng
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2026
Y1 - 2026
N2 - Multi-UAV-assisted information collection is an effective solution for large-scale Internet of Things (IoT) networks, but it is challenged by limited onboard energy, charging requirements, and the need for coordinated path planning among multiple UAVs. This paper investigates a multi-UAV information collection problem in which UAVs cooperatively collect data from distributed IoT devices under energy constraints, charging operations, and collision avoidance constraints, with the objective of minimizing the total mission completion time. The considered problem is formulated as a multi-agent Markov decision process. To address the coupled and high-dimensional decision-making nature of the system, a multi-agent deep deterministic policy gradient (MADDPG) framework with centralized training and decentralized execution is developed for cooperative UAV path planning. An energy-aware reward function is designed to jointly account for information collection efficiency, energy consumption, and charging behavior. Simulation results demonstrate that the proposed approach achieves shorter mission completion time and better scalability compared with DDPG-based and MAPPO-based methods.
AB - Multi-UAV-assisted information collection is an effective solution for large-scale Internet of Things (IoT) networks, but it is challenged by limited onboard energy, charging requirements, and the need for coordinated path planning among multiple UAVs. This paper investigates a multi-UAV information collection problem in which UAVs cooperatively collect data from distributed IoT devices under energy constraints, charging operations, and collision avoidance constraints, with the objective of minimizing the total mission completion time. The considered problem is formulated as a multi-agent Markov decision process. To address the coupled and high-dimensional decision-making nature of the system, a multi-agent deep deterministic policy gradient (MADDPG) framework with centralized training and decentralized execution is developed for cooperative UAV path planning. An energy-aware reward function is designed to jointly account for information collection efficiency, energy consumption, and charging behavior. Simulation results demonstrate that the proposed approach achieves shorter mission completion time and better scalability compared with DDPG-based and MAPPO-based methods.
KW - MADDPG
KW - UAV
KW - aerial base station
KW - wireless communication system
UR - https://www.scopus.com/pages/publications/105038673928
U2 - 10.1109/TNSE.2026.3690003
DO - 10.1109/TNSE.2026.3690003
M3 - Article
AN - SCOPUS:105038673928
SN - 2327-4697
JO - IEEE Transactions on Network Science and Engineering
JF - IEEE Transactions on Network Science and Engineering
ER -