Deep Reinforcement Learning Based Trajectory Planning for Multi-UAV Cooperative Data Collection

Yuqi Miao, Lei Lei, Lijuan Zhang*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the context of UAV trajectory planning for data collection, challenges such as the uncertainty of a large-scale dynamic unknown environment and the need for multi-UAV coordination are prevalent. To address these challenges, this paper proposes a UAV data collection trajectory planning algorithm based on the D3QN (Double Dueling Deep Q-Network) algorithm. The proposed algorithm enables multiple UAVs to dynamically plan their flight paths for data collection in unknown environments through centralized training and distributed application. The algorithm’s performance is improved by incorporating competition mechanisms, candidate node queues, and reward function reshaping techniques. Based on the simulation results, the proposed algorithm outperforms similar algorithms in terms of success rates and task durations.

Original languageEnglish
Title of host publicationProceedings of the 2nd International Conference on Internet of Things, Communication and Intelligent Technology
EditorsJian Dong, Long Zhang, Deqiang Cheng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages101-108
Number of pages8
ISBN (Print)9789819727568
DOIs
Publication statusPublished - 2024
Externally publishedYes
Event2nd International Conference on Internet of Things, Communication and Intelligent Technology, IoTCIT 2023 - Xuzhou, China
Duration: 22 Sept 202324 Sept 2023

Publication series

NameLecture Notes in Electrical Engineering
Volume1197
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference2nd International Conference on Internet of Things, Communication and Intelligent Technology, IoTCIT 2023
Country/TerritoryChina
CityXuzhou
Period22/09/2324/09/23

Keywords

  • D3QN
  • data collection
  • deep reinforcement learning
  • trajectory planning
  • UAV

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning Based Trajectory Planning for Multi-UAV Cooperative Data Collection'. Together they form a unique fingerprint.

Cite this