TY - JOUR
T1 - Multiagent reinforcement learning with evolution for multitarget tracking by unmanned aerial vehicle swarm
AU - Jiao, Keming
AU - Chen, Jie
AU - Xin, Bin
AU - Li, Li
AU - Ding, Yulong
AU - Zhao, Zhixin
AU - Zheng, Yifan
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/9
Y1 - 2025/9
N2 - Multitarget tracking has immense potential in both military and civilian applications. For an unmanned aerial vehicle (UAV) swarm, a critical challenge in multitarget tracking is how to coordinate multiple unmanned aerial vehicles to continuously and accurately track multiple targets. This paper considers the cooperative decision-making problem for multitarget tracking by multiple unmanned aerial vehicles with limited sensing range in a dynamic environment. Meanwhile, a multiagent advantage actor critic with evolution, named MAA2CE, is proposed to learn the cooperative tracking policies for a UAV swarm. During training, each UAV is viewed as an agent with a policy network, making decisions based on its own information and policy. The prioritized experience replay is adopted to take full advantage of the valuable experience for learning. Considering the tracking performance of agents, the high performance agent can replicate its own network parameters to the other agents with a certain probability by network evolution. The experimental results demonstrate that the proposed algorithm is superior to three peer algorithms in learning efficiency, can acquire better collaborative tracking policies, and significantly improves the collaborative multitarget tracking proficiency of the UAV swarm.
AB - Multitarget tracking has immense potential in both military and civilian applications. For an unmanned aerial vehicle (UAV) swarm, a critical challenge in multitarget tracking is how to coordinate multiple unmanned aerial vehicles to continuously and accurately track multiple targets. This paper considers the cooperative decision-making problem for multitarget tracking by multiple unmanned aerial vehicles with limited sensing range in a dynamic environment. Meanwhile, a multiagent advantage actor critic with evolution, named MAA2CE, is proposed to learn the cooperative tracking policies for a UAV swarm. During training, each UAV is viewed as an agent with a policy network, making decisions based on its own information and policy. The prioritized experience replay is adopted to take full advantage of the valuable experience for learning. Considering the tracking performance of agents, the high performance agent can replicate its own network parameters to the other agents with a certain probability by network evolution. The experimental results demonstrate that the proposed algorithm is superior to three peer algorithms in learning efficiency, can acquire better collaborative tracking policies, and significantly improves the collaborative multitarget tracking proficiency of the UAV swarm.
KW - Multitarget tracking
KW - Reinforcement learning Evolution
KW - Unmanned aerial vehicle
UR - http://www.scopus.com/inward/record.url?scp=105008428979&partnerID=8YFLogxK
U2 - 10.1016/j.asoc.2025.113463
DO - 10.1016/j.asoc.2025.113463
M3 - Article
AN - SCOPUS:105008428979
SN - 1568-4946
VL - 181
JO - Applied Soft Computing
JF - Applied Soft Computing
M1 - 113463
ER -