TY - JOUR
T1 - Spatiotemporal Relationship Cognitive Learning for Multirobot Air Combat
AU - Piao, Haiyin
AU - Han, Yue
AU - He, Shaoming
AU - Yu, Chao
AU - Fan, Songyuan
AU - Hou, Yaqing
AU - Bai, Chengchao
AU - Mo, Li
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023/12/1
Y1 - 2023/12/1
N2 - cognition is crucial to learning-based multirobot systems (MRSs). As an advanced application of MRSs for fierce confrontation, the relationships among autonomous air combat robots inherently present complex time-varying characteristics, which makes relationship cognition even more difficult. However, previous studies have only focused on spatial cooperative relationships, thus ignoring the potential impact of the temporal dynamics of relationships on long-term cooperative behaviors. To tackle this drawback, we propose a novel multiagent deep reinforcement learning (MADRL)-based autonomous air combat robots collaboration algorithm, called spatiotemporal aerial robots relationship co-optimization (STARCO). STARCO formulates the complex dynamic relationship cognition problem into a spatiotemporal deep graph neural network (GNN) learning problem. On this basis, we overcome the limitations of previous methods, by accurately capturing the key spatiotemporal patterns from aggressive air combat, and enable global collaborative decision making through joint strategy optimization. An empirical study shows that STARCO outperforms several state-of-the-art MARL baselines by 24.6% in learning performance. We also demonstrate that STARCO is capable of evolving various cooperative strategies comparable to human expert knowledge.
AB - cognition is crucial to learning-based multirobot systems (MRSs). As an advanced application of MRSs for fierce confrontation, the relationships among autonomous air combat robots inherently present complex time-varying characteristics, which makes relationship cognition even more difficult. However, previous studies have only focused on spatial cooperative relationships, thus ignoring the potential impact of the temporal dynamics of relationships on long-term cooperative behaviors. To tackle this drawback, we propose a novel multiagent deep reinforcement learning (MADRL)-based autonomous air combat robots collaboration algorithm, called spatiotemporal aerial robots relationship co-optimization (STARCO). STARCO formulates the complex dynamic relationship cognition problem into a spatiotemporal deep graph neural network (GNN) learning problem. On this basis, we overcome the limitations of previous methods, by accurately capturing the key spatiotemporal patterns from aggressive air combat, and enable global collaborative decision making through joint strategy optimization. An empirical study shows that STARCO outperforms several state-of-the-art MARL baselines by 24.6% in learning performance. We also demonstrate that STARCO is capable of evolving various cooperative strategies comparable to human expert knowledge.
KW - Air combat
KW - graph neural network (GNN)
KW - multiagent deep reinforcement learning (MADRL)
KW - relationship robot
KW - spatiotemporal
UR - http://www.scopus.com/inward/record.url?scp=85149366605&partnerID=8YFLogxK
U2 - 10.1109/TCDS.2023.3250819
DO - 10.1109/TCDS.2023.3250819
M3 - Article
AN - SCOPUS:85149366605
SN - 2379-8920
VL - 15
SP - 2254
EP - 2268
JO - IEEE Transactions on Cognitive and Developmental Systems
JF - IEEE Transactions on Cognitive and Developmental Systems
IS - 4
ER -