Reinforcement Learning-Based Optimal Formation Tracking for UAVs With Safety Constraints

Research output: Contribution to journalArticlepeer-review

Abstract

This article develops a scheme to tackle the safe optimal formation tracking issue for multiple fixed-wing uncrewed aerial vehicles (UAVs) with external disturbances and asymmetric control constraints. To ensure safety constraints in collision avoidance, a safe set is first constructed by a super level set of a continuously differential function, following a novel control barrier function (CBF) to characterize the safety. Subsequently, we transform the safe optimal formation tracking control into a constrained zero-sum (ZS) differential game to mitigate the destabilizing effects of the disturbances, where the cost function is constructed in a nonquadratic form to cope with asymmetric input constraints. Particularly, the designed CBF is integrated into the cost function to penalize the unsafe behavior, and a damping coefficient is included to balance the optimality and safety. Afterwords, a critic-only reinforcement learning (RL) strategy is developed to learn the robust safe Nash policy, where the critic weights are updated by applying experience replay technology, thus avoiding the requirement for persistence of excitation condition. Moreover, the stability and forward invariance of the safe set of the presented scheme are also verified. Finally, simulation examples are provided to substantiate the validity of the control scheme.

Original languageEnglish
JournalIEEE Transactions on Neural Networks and Learning Systems
DOIs
Publication statusAccepted/In press - 2026

Keywords

  • Control barrier function (CBF)
  • control constraints
  • formation tracking
  • safe reinforcement learning (RL)
  • uncrewed aerial vehicle (UAV)

Fingerprint

Dive into the research topics of 'Reinforcement Learning-Based Optimal Formation Tracking for UAVs With Safety Constraints'. Together they form a unique fingerprint.

Cite this