Learning Smooth Motion Planning for Intelligent Aerial Transportation Vehicles by Stable Auxiliary Gradient

Haiyin Piao*, Jin Yu, Li Mo, Xin Yang*, Zhimin Liu, Zhixiao Sun, Ming Lu, Zhen Yang, Deyun Zhou

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Deep Reinforcement Learning (DRL) has been widely attempted for solving real-time intelligent aerial transportation vehicle motion planning tasks recently. When interacting with environment, DRL-driven aerial vehicles inevitably switch the steering actions in high frequency during both exploration and execution phase, resulting in the well known flight trajectory oscillation issue, which makes flight dynamics unstable, and even endangers flight safety in serious cases. Unfortunately, there is hardly any literature about achieving flight trajectory smoothness in DRL-based motion planning. In view of this, we originally formalize the practical flight trajectory smoothen problem as a three-level Nested pArameterized Smooth Trajectory Optimization (NASTO) form. On this basis, a novel Stable Auxiliary Gradient (SAG) algorithm is proposed, which significantly smoothens the DRL-generated flight motions by constructing two independent optimization aspects: the major gradient, and the stable auxiliary gradient. Experimental result reveals that the proposed SAG algorithm outperforms baseline DRL-based intelligent aerial transportation vehicle motion planning algorithms in terms of both learning efficiency and flight motion smoothness.

Original languageEnglish
Pages (from-to)24464-24473
Number of pages10
JournalIEEE Transactions on Intelligent Transportation Systems
Volume23
Issue number12
DOIs
Publication statusPublished - 1 Dec 2022

Keywords

  • Motion planning
  • aerial
  • deep reinforcement learning (DRL)
  • intelligent
  • smooth
  • vehicle

Fingerprint

Dive into the research topics of 'Learning Smooth Motion Planning for Intelligent Aerial Transportation Vehicles by Stable Auxiliary Gradient'. Together they form a unique fingerprint.

Cite this