Deep Reinforcement Learning for Flocking Motion of Multi-UAV Systems: Learn From a Digital Twin

Gaoqing Shen, Lei Lei*, Zhilin Li, Shengsuo Cai, Lijuan Zhang, Pan Cao, Xiaojiao Liu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

77 Citations (Scopus)

Abstract

Over the past decades, unmanned aerial vehicles (UAVs) have been widely used in both military and civilian fields. In these applications, flocking motion is a fundamental but crucial operation of multi-UAV systems. Traditional flocking motion methods usually designed for a specific environment. However, the real environment is mostly unknown and stochastic, which greatly reduces the practicality of these methods. In this article, deep reinforcement learning (DRL) is used to realize the flocking motion of multi-UAV systems. Considering that the sim-to-real problem restricts the application of DRL to the flocking motion scenario, a digital twin (DT)-enabled DRL training framework is proposed to solve this problem. The DRL model can learn from DT and be quickly deployed on the real-world UAV with the help of DT. Under this training framework, this article proposes an actor-critic DRL algorithm, named behavior-coupling deep deterministic policy gradient (BCDDPG), for the flocking motion problem, which is inspired by the flocking behavior of animals. Extensive simulations are conducted to evaluate the performance of BCDDPG. Simulation results show that BCDDPG achieves a higher average reward and performs better in terms of arrival rate and collision rate compared with the existing methods.

Original languageEnglish
Pages (from-to)11141-11153
Number of pages13
JournalIEEE Internet of Things Journal
Volume9
Issue number13
DOIs
Publication statusPublished - 1 Jul 2022
Externally publishedYes

Keywords

  • Deep reinforcement learning (DRL)
  • digital twin (DT)
  • flocking motion
  • multi-UAV systems

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning for Flocking Motion of Multi-UAV Systems: Learn From a Digital Twin'. Together they form a unique fingerprint.

Cite this