Skip to main navigation Skip to search Skip to main content

MADDPG Based Path Planning for Multi-UAV Information Collection in IoT Networks

  • Liangbin Zhu
  • , Kai Yang
  • , Jinglei Li*
  • , Yuxuan Yang
  • , Qinghai Yang
  • , Xiaozheng Gao
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • China Electronics Technology Group Corporation
  • Xidian University

Research output: Contribution to journalArticlepeer-review

Abstract

Multi-UAV-assisted information collection is an effective solution for large-scale Internet of Things (IoT) networks, but it is challenged by limited onboard energy, charging requirements, and the need for coordinated path planning among multiple UAVs. This paper investigates a multi-UAV information collection problem in which UAVs cooperatively collect data from distributed IoT devices under energy constraints, charging operations, and collision avoidance constraints, with the objective of minimizing the total mission completion time. The considered problem is formulated as a multi-agent Markov decision process. To address the coupled and high-dimensional decision-making nature of the system, a multi-agent deep deterministic policy gradient (MADDPG) framework with centralized training and decentralized execution is developed for cooperative UAV path planning. An energy-aware reward function is designed to jointly account for information collection efficiency, energy consumption, and charging behavior. Simulation results demonstrate that the proposed approach achieves shorter mission completion time and better scalability compared with DDPG-based and MAPPO-based methods.

Original languageEnglish
Pages (from-to)9300-9319
Number of pages20
JournalIEEE Transactions on Network Science and Engineering
Volume13
DOIs
Publication statusPublished - 2026
Externally publishedYes

Keywords

  • MADDPG
  • UAV
  • aerial base station
  • wireless communication system

Fingerprint

Dive into the research topics of 'MADDPG Based Path Planning for Multi-UAV Information Collection in IoT Networks'. Together they form a unique fingerprint.

Cite this