Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach

Yang Wang, Zhen Gao

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)

摘要

In this paper, we investigate a multi-user downlink multiple-input single-output (MISO) unmanned aerial vehicle (UAV) communication system, where a multi-antenna UAV is employed to serve multiple ground terminals. Unlike existing approaches focus only on a simplified two-dimensional scenario, this paper considers a three-dimensional (3D) urban environment, where the UAV's 3D trajectory is designed to minimize data transmission completion time subject to practical throughput and flight movement constraints. Specifically, we propose a deep reinforcement learning (DRL)-based trajectory design for completion time minimization (DRL- TDCTM), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e., the merged pheromone, as a reference of reward which facilitates the algorithm design. By interacting with the external environment in the corresponding Markov decision process, the proposed algorithm can continuously and adaptively learn how to adjust the UAV's movement strategy. Finally, simulation results show the superiority of the proposed DRL- TDCTM algorithm over the conventional baseline methods.

源语言英语
主期刊名2021 IEEE/CIC International Conference on Communications in China, ICCC 2021
出版商Institute of Electrical and Electronics Engineers Inc.
706-711
页数6
ISBN(电子版)9781665443852
DOI
出版状态已出版 - 28 7月 2021
活动2021 IEEE/CIC International Conference on Communications in China, ICCC 2021 - Xiamen, 中国
期限: 28 7月 202130 7月 2021

出版系列

姓名2021 IEEE/CIC International Conference on Communications in China, ICCC 2021

会议

会议2021 IEEE/CIC International Conference on Communications in China, ICCC 2021
国家/地区中国
Xiamen
时期28/07/2130/07/21

指纹

探究 'Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach' 的科研主题。它们共同构成独一无二的指纹。

引用此