Multi-Vehicle Cooperative Persistent Coverage for Random Target Search

Zhuo Li, Guangzheng Li, Alireza Sadeghi, Jian Sun, Gang Wang, Jialin Wang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

This letter investigates the target search problem for a network of autonomous vehicles, aiming to maximize the detection of randomly appearing targets within a given area. Considering no prior knowledge of the targets is available, we propose a multi-vehicle cooperative persistent coverage scheme under the framework of multi-agent reinforcement learning, in contrast to heuristic and model-based optimization methods in existing works. We model the persistent coverage problem as a partially observable Markov decision process (POMDP) due to the vehicles' limited observation ranges, and introduce a knowability map to characterize their knowledge of the target area. Each vehicle employs a distributed estimator, leveraging its own observations and shared information from neighboring vehicles, to construct a globally estimated knowability map - thereby mitigating partial observability. The persistent coverage policies are learned with the architecture of centralized training and distributed execution, enabling cooperative and efficient target search by fully exploiting shared information. Moreover, we propose an adaptive partition method for the target area to ensure a fixed dimension of the state space in the POMDP, which can improve scalability of the learned policy to target areas with various sizes. Simulations validate effectiveness and scalability of the proposed cooperative scheme.

Original languageEnglish
Pages (from-to)6680-6687
Number of pages8
JournalIEEE Robotics and Automation Letters
Volume10
Issue number7
DOIs
Publication statusPublished - 2025

Keywords

  • Cooperative search
  • multi-vehicle system
  • persistent coverage
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Multi-Vehicle Cooperative Persistent Coverage for Random Target Search'. Together they form a unique fingerprint.

Cite this