Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG

Zhiwen Wang, Bo Wang, Xiao He, Qing Fei

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we aim to develop a reinforcement learning (RL) based algorithm for the task allocation and path planning problem of multi-agent systems where all agents autonomously head to task points with obstacle avoidance. To address the challenge of slow convergence speed and insufficient reward setting when using traditional RL methods, the named Pri-MADDPG algorithm based on prioritized experience replay is proposed. By integrating task allocation and path planning problem, we first construct a framework for multi-agent reinforcement learning training by designing essential elements including appropriate observation space, action space, and reward functions. Then a prioritized experience replay method, in which the value network loss is employed for the priority evaluation, is utilized to enhance policy learning performance. A reward mechanism is further improved through taking into consideration of both global task objectives and individual objectives. To verify the effectiveness of Pri-MADDPG algorithm, experiments are finally carried out with the well-designed reward mechanism. The results demonstrate that all agents can autonomously accomplish task allocation with smooth and highly safe trajectories while achieving faster convergence speed, better stability, and superior performance.

Original languageEnglish
Title of host publicationProceedings - 2023 China Automation Congress, CAC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages6569-6574
Number of pages6
ISBN (Electronic)9798350303759
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event2023 China Automation Congress, CAC 2023 - Chongqing, China
Duration: 17 Nov 202319 Nov 2023

Publication series

NameProceedings - 2023 China Automation Congress, CAC 2023

Conference

Conference2023 China Automation Congress, CAC 2023
Country/TerritoryChina
CityChongqing
Period17/11/2319/11/23

Keywords

  • path planning
  • prioritized experience replay
  • reinforcement learning
  • task allocation

Fingerprint

Dive into the research topics of 'Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG'. Together they form a unique fingerprint.

Cite this