TY - GEN
T1 - Explainable Autonomous Maneuver-Penetration Strategy for UAVs via Continuous-Action Learning Automata
AU - Qi, Puyang
AU - Liu, Yangxin
AU - Li, Yiheng
AU - Pan, Xiaochun
AU - Xia, Qunli
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Modern air defense systems are becoming faster, more complex, and agile, rendering static evasive maneuvers ineffective for high-speed or hypersonic UAVs. This study proposes a lightweight, explainable Continuous-Action Learning Automata (CARLA) framework as an alternative to black-box DRL policies, reformulating UAV penetration as a probabilistic decision problem in human-interpretable maneuver primitives. Six primitives - straight acceleration, U-turn, S-turn, square wave, barrel roll, and inclined plane - are discretized with trigger range and commanded overload into a 3D action grid. Offline training simulates thousands of random 3-DOF engagement scenarios, evaluates sampled actions via Monte Carlo, and updates joint probability density using decaying Gaussian kernels for exploration-to-exploitation shift, converging to a peaked lookup table that prunes suboptimal tactics. Evaluation on 1,000 independent cases yields 88.8% penetration success - 50.3 points higher than random sampling - with barrel-roll dominating selections. Onboard execution via table lookup and scaling achieves sub-millisecond latency and full transparency for certification. The framework combines RL adaptability with rule-based interpretability for practical, real-time UAV penetration.
AB - Modern air defense systems are becoming faster, more complex, and agile, rendering static evasive maneuvers ineffective for high-speed or hypersonic UAVs. This study proposes a lightweight, explainable Continuous-Action Learning Automata (CARLA) framework as an alternative to black-box DRL policies, reformulating UAV penetration as a probabilistic decision problem in human-interpretable maneuver primitives. Six primitives - straight acceleration, U-turn, S-turn, square wave, barrel roll, and inclined plane - are discretized with trigger range and commanded overload into a 3D action grid. Offline training simulates thousands of random 3-DOF engagement scenarios, evaluates sampled actions via Monte Carlo, and updates joint probability density using decaying Gaussian kernels for exploration-to-exploitation shift, converging to a peaked lookup table that prunes suboptimal tactics. Evaluation on 1,000 independent cases yields 88.8% penetration success - 50.3 points higher than random sampling - with barrel-roll dominating selections. Onboard execution via table lookup and scaling achieves sub-millisecond latency and full transparency for certification. The framework combines RL adaptability with rule-based interpretability for practical, real-time UAV penetration.
KW - continuous action learning automata (CARLA)
KW - explainable reinforcement learning
KW - maneuver primitive library
KW - penetration strategy
UR - https://www.scopus.com/pages/publications/105031875271
U2 - 10.1109/ICUS66297.2025.11294221
DO - 10.1109/ICUS66297.2025.11294221
M3 - Conference contribution
AN - SCOPUS:105031875271
T3 - Proceedings of 2025 IEEE International Conference on Unmanned Systems, ICUS 2025
SP - 1730
EP - 1738
BT - Proceedings of 2025 IEEE International Conference on Unmanned Systems, ICUS 2025
A2 - Song, Rong
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 IEEE International Conference on Unmanned Systems, ICUS 2025
Y2 - 18 September 2025 through 19 September 2025
ER -