Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach

Yang Wang; Zhen Gao

doi:10.1109/ICCC52777.2021.9580401

Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach

Yang Wang, Zhen Gao

Advanced Research Institute of Multidisciplinary Science

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

4 Citations (Scopus)

Abstract

In this paper, we investigate a multi-user downlink multiple-input single-output (MISO) unmanned aerial vehicle (UAV) communication system, where a multi-antenna UAV is employed to serve multiple ground terminals. Unlike existing approaches focus only on a simplified two-dimensional scenario, this paper considers a three-dimensional (3D) urban environment, where the UAV's 3D trajectory is designed to minimize data transmission completion time subject to practical throughput and flight movement constraints. Specifically, we propose a deep reinforcement learning (DRL)-based trajectory design for completion time minimization (DRL- TDCTM), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e., the merged pheromone, as a reference of reward which facilitates the algorithm design. By interacting with the external environment in the corresponding Markov decision process, the proposed algorithm can continuously and adaptively learn how to adjust the UAV's movement strategy. Finally, simulation results show the superiority of the proposed DRL- TDCTM algorithm over the conventional baseline methods.

Original language	English
Title of host publication	2021 IEEE/CIC International Conference on Communications in China, ICCC 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	706-711
Number of pages	6
ISBN (Electronic)	9781665443852
DOIs	https://doi.org/10.1109/ICCC52777.2021.9580401
Publication status	Published - 28 Jul 2021
Event	2021 IEEE/CIC International Conference on Communications in China, ICCC 2021 - Xiamen, China Duration: 28 Jul 2021 → 30 Jul 2021

Publication series

Name	2021 IEEE/CIC International Conference on Communications in China, ICCC 2021

Conference

Conference	2021 IEEE/CIC International Conference on Communications in China, ICCC 2021
Country/Territory	China
City	Xiamen
Period	28/07/21 → 30/07/21

Keywords

3D trajectory design
Deep reinforcement learning
Multi-antenna UAV
UAV communication systems

Access to Document

10.1109/ICCC52777.2021.9580401

Cite this

Wang, Y., & Gao, Z. (2021). Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach. In 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021 (pp. 706-711). (2021 IEEE/CIC International Conference on Communications in China, ICCC 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCC52777.2021.9580401

Wang, Yang ; Gao, Zhen. / Three-dimensional trajectory design for multi-user MISO UAV communications : A deep reinforcement learning approach. 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 706-711 (2021 IEEE/CIC International Conference on Communications in China, ICCC 2021).

@inproceedings{fac1e2eea3f443b785ae4cfe3a3efd79,

title = "Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach",

abstract = "In this paper, we investigate a multi-user downlink multiple-input single-output (MISO) unmanned aerial vehicle (UAV) communication system, where a multi-antenna UAV is employed to serve multiple ground terminals. Unlike existing approaches focus only on a simplified two-dimensional scenario, this paper considers a three-dimensional (3D) urban environment, where the UAV's 3D trajectory is designed to minimize data transmission completion time subject to practical throughput and flight movement constraints. Specifically, we propose a deep reinforcement learning (DRL)-based trajectory design for completion time minimization (DRL- TDCTM), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e., the merged pheromone, as a reference of reward which facilitates the algorithm design. By interacting with the external environment in the corresponding Markov decision process, the proposed algorithm can continuously and adaptively learn how to adjust the UAV's movement strategy. Finally, simulation results show the superiority of the proposed DRL- TDCTM algorithm over the conventional baseline methods.",

keywords = "3D trajectory design, Deep reinforcement learning, Multi-antenna UAV, UAV communication systems",

author = "Yang Wang and Zhen Gao",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021 ; Conference date: 28-07-2021 Through 30-07-2021",

year = "2021",

month = jul,

day = "28",

doi = "10.1109/ICCC52777.2021.9580401",

language = "English",

series = "2021 IEEE/CIC International Conference on Communications in China, ICCC 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "706--711",

booktitle = "2021 IEEE/CIC International Conference on Communications in China, ICCC 2021",

address = "United States",

}

Wang, Y & Gao, Z 2021, Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach. in 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021. 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021, Institute of Electrical and Electronics Engineers Inc., pp. 706-711, 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021, Xiamen, China, 28/07/21. https://doi.org/10.1109/ICCC52777.2021.9580401

Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach. / Wang, Yang; Gao, Zhen.
2021 IEEE/CIC International Conference on Communications in China, ICCC 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 706-711 (2021 IEEE/CIC International Conference on Communications in China, ICCC 2021).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Three-dimensional trajectory design for multi-user MISO UAV communications

T2 - 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021

AU - Wang, Yang

AU - Gao, Zhen

PY - 2021/7/28

Y1 - 2021/7/28

N2 - In this paper, we investigate a multi-user downlink multiple-input single-output (MISO) unmanned aerial vehicle (UAV) communication system, where a multi-antenna UAV is employed to serve multiple ground terminals. Unlike existing approaches focus only on a simplified two-dimensional scenario, this paper considers a three-dimensional (3D) urban environment, where the UAV's 3D trajectory is designed to minimize data transmission completion time subject to practical throughput and flight movement constraints. Specifically, we propose a deep reinforcement learning (DRL)-based trajectory design for completion time minimization (DRL- TDCTM), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e., the merged pheromone, as a reference of reward which facilitates the algorithm design. By interacting with the external environment in the corresponding Markov decision process, the proposed algorithm can continuously and adaptively learn how to adjust the UAV's movement strategy. Finally, simulation results show the superiority of the proposed DRL- TDCTM algorithm over the conventional baseline methods.

AB - In this paper, we investigate a multi-user downlink multiple-input single-output (MISO) unmanned aerial vehicle (UAV) communication system, where a multi-antenna UAV is employed to serve multiple ground terminals. Unlike existing approaches focus only on a simplified two-dimensional scenario, this paper considers a three-dimensional (3D) urban environment, where the UAV's 3D trajectory is designed to minimize data transmission completion time subject to practical throughput and flight movement constraints. Specifically, we propose a deep reinforcement learning (DRL)-based trajectory design for completion time minimization (DRL- TDCTM), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e., the merged pheromone, as a reference of reward which facilitates the algorithm design. By interacting with the external environment in the corresponding Markov decision process, the proposed algorithm can continuously and adaptively learn how to adjust the UAV's movement strategy. Finally, simulation results show the superiority of the proposed DRL- TDCTM algorithm over the conventional baseline methods.

KW - 3D trajectory design

KW - Deep reinforcement learning

KW - Multi-antenna UAV

KW - UAV communication systems

UR - http://www.scopus.com/inward/record.url?scp=85119342524&partnerID=8YFLogxK

U2 - 10.1109/ICCC52777.2021.9580401

DO - 10.1109/ICCC52777.2021.9580401

M3 - Conference contribution

AN - SCOPUS:85119342524

T3 - 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021

SP - 706

EP - 711

BT - 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 28 July 2021 through 30 July 2021

ER -

Wang Y, Gao Z. Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach. In 2021 IEEE/CIC International Conference on Communications in China, ICCC 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 706-711. (2021 IEEE/CIC International Conference on Communications in China, ICCC 2021). doi: 10.1109/ICCC52777.2021.9580401

Three-dimensional trajectory design for multi-user MISO UAV communications: A deep reinforcement learning approach

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this