Self-play Decision-making Method of Deep Reinforcement Learning Guided by Behavior Tree under Complex Environment

Xiaochen Xiong*, Shuai Wang, Bo Wang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With advances in artificial intelligence, military simulation has evolved from a human-to-human exercise to autonomous self-improvement through self-play of reinforcement learning. Unlike air and sea engagements, the complexity of terrain in land battles cannot be ignored. On the 2D map of the real world, we introduce elevation data through grid map to construct basic scenes and apply fuzzy theory to clarify the nuances of different terrain. Based on the latest unmanned combat vehicle data, the land combat agent model is designed, and the depth deterministic strategy gradient (DDPG) algorithm and near end strategy optimization (PPO) algorithm are adopted. Through self-play, the strategy of the vehicle is optimized so that it can effectively adapt to various battlefield scenarios. This method not only enhances the realism of simulation by combining key terrain features, but also significantly improves the strategy capability of autonomous agents. By constantly playing with themselves to perfect their tactics, these agents can better face the unpredictable dynamics of land warfare. Our results show that by incorporating training with DDPG and PPO algorithms guided by behavior trees, the ability of agents to navigate and engage in complex terrain is significantly improved, demonstrating the potential of AI-driven simulation to shape future military strategy and training.

Original languageEnglish
Title of host publicationProceedings of the 43rd Chinese Control Conference, CCC 2024
EditorsJing Na, Jian Sun
PublisherIEEE Computer Society
Pages3988-3993
Number of pages6
ISBN (Electronic)9789887581581
DOIs
Publication statusPublished - 2024
Event43rd Chinese Control Conference, CCC 2024 - Kunming, China
Duration: 28 Jul 202431 Jul 2024

Publication series

NameChinese Control Conference, CCC
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference43rd Chinese Control Conference, CCC 2024
Country/TerritoryChina
CityKunming
Period28/07/2431/07/24

Keywords

  • behavior trees
  • deep reinforcement learning
  • fuzzy theory
  • self-play

Fingerprint

Dive into the research topics of 'Self-play Decision-making Method of Deep Reinforcement Learning Guided by Behavior Tree under Complex Environment'. Together they form a unique fingerprint.

Cite this