Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach

Yang Wang, Zhen Gao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

In this paper, we investigate a multi-user downlink multiple-input single-output (MISO) unmanned aerial vehicle (UAV) communication system, where a multi-antenna UAV is employed to serve multiple ground terminals. Unlike existing approaches focus only on a simplified two-dimensional scenario, this paper considers a three-dimensional (3D) urban environment, where the UAV's 3D trajectory is designed to minimize data transmission completion time subject to practical throughput and flight movement constraints. Specifically, we propose a deep reinforcement learning (DRL)-based trajectory design for completion time minimization (DRL- TDCTM), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e., the merged pheromone, as a reference of reward which facilitates the algorithm design. By interacting with the external environment in the corresponding Markov decision process, the proposed algorithm can continuously and adaptively learn how to adjust the UAV's movement strategy. Finally, simulation results show the superiority of the proposed DRL- TDCTM algorithm over the conventional baseline methods.

Original languageEnglish
Title of host publication2021 IEEE/CIC International Conference on Communications in China, ICCC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages706-711
Number of pages6
ISBN (Electronic)9781665443852
DOIs
Publication statusPublished - 28 Jul 2021
Event2021 IEEE/CIC International Conference on Communications in China, ICCC 2021 - Xiamen, China
Duration: 28 Jul 202130 Jul 2021

Publication series

Name2021 IEEE/CIC International Conference on Communications in China, ICCC 2021

Conference

Conference2021 IEEE/CIC International Conference on Communications in China, ICCC 2021
Country/TerritoryChina
CityXiamen
Period28/07/2130/07/21

Keywords

  • 3D trajectory design
  • Deep reinforcement learning
  • Multi-antenna UAV
  • UAV communication systems

Fingerprint

Dive into the research topics of 'Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach'. Together they form a unique fingerprint.

Cite this