Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG

Zhiwen Wang; Bo Wang; Xiao He; Qing Fei

doi:10.1109/CAC59555.2023.10452082

Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG

Zhiwen Wang, Bo Wang, Xiao He, Qing Fei

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

In this paper, we aim to develop a reinforcement learning (RL) based algorithm for the task allocation and path planning problem of multi-agent systems where all agents autonomously head to task points with obstacle avoidance. To address the challenge of slow convergence speed and insufficient reward setting when using traditional RL methods, the named Pri-MADDPG algorithm based on prioritized experience replay is proposed. By integrating task allocation and path planning problem, we first construct a framework for multi-agent reinforcement learning training by designing essential elements including appropriate observation space, action space, and reward functions. Then a prioritized experience replay method, in which the value network loss is employed for the priority evaluation, is utilized to enhance policy learning performance. A reward mechanism is further improved through taking into consideration of both global task objectives and individual objectives. To verify the effectiveness of Pri-MADDPG algorithm, experiments are finally carried out with the well-designed reward mechanism. The results demonstrate that all agents can autonomously accomplish task allocation with smooth and highly safe trajectories while achieving faster convergence speed, better stability, and superior performance.

Original language	English
Title of host publication	Proceedings - 2023 China Automation Congress, CAC 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	6569-6574
Number of pages	6
ISBN (Electronic)	9798350303759
DOIs	https://doi.org/10.1109/CAC59555.2023.10452082
Publication status	Published - 2023
Externally published	Yes
Event	2023 China Automation Congress, CAC 2023 - Chongqing, China Duration: 17 Nov 2023 → 19 Nov 2023

Publication series

Name	Proceedings - 2023 China Automation Congress, CAC 2023

Conference

Conference	2023 China Automation Congress, CAC 2023
Country/Territory	China
City	Chongqing
Period	17/11/23 → 19/11/23

Keywords

path planning
prioritized experience replay
reinforcement learning
task allocation

Access to Document

10.1109/CAC59555.2023.10452082

Cite this

@inproceedings{45bfc4230b3f42f7bb8202155696e2ed,

title = "Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG",

abstract = "In this paper, we aim to develop a reinforcement learning (RL) based algorithm for the task allocation and path planning problem of multi-agent systems where all agents autonomously head to task points with obstacle avoidance. To address the challenge of slow convergence speed and insufficient reward setting when using traditional RL methods, the named Pri-MADDPG algorithm based on prioritized experience replay is proposed. By integrating task allocation and path planning problem, we first construct a framework for multi-agent reinforcement learning training by designing essential elements including appropriate observation space, action space, and reward functions. Then a prioritized experience replay method, in which the value network loss is employed for the priority evaluation, is utilized to enhance policy learning performance. A reward mechanism is further improved through taking into consideration of both global task objectives and individual objectives. To verify the effectiveness of Pri-MADDPG algorithm, experiments are finally carried out with the well-designed reward mechanism. The results demonstrate that all agents can autonomously accomplish task allocation with smooth and highly safe trajectories while achieving faster convergence speed, better stability, and superior performance.",

keywords = "path planning, prioritized experience replay, reinforcement learning, task allocation",

author = "Zhiwen Wang and Bo Wang and Xiao He and Qing Fei",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 China Automation Congress, CAC 2023 ; Conference date: 17-11-2023 Through 19-11-2023",

year = "2023",

doi = "10.1109/CAC59555.2023.10452082",

language = "English",

series = "Proceedings - 2023 China Automation Congress, CAC 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "6569--6574",

booktitle = "Proceedings - 2023 China Automation Congress, CAC 2023",

address = "United States",

}

Wang, Z, Wang, B, He, X & Fei, Q 2023, Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG. in Proceedings - 2023 China Automation Congress, CAC 2023. Proceedings - 2023 China Automation Congress, CAC 2023, Institute of Electrical and Electronics Engineers Inc., pp. 6569-6574, 2023 China Automation Congress, CAC 2023, Chongqing, China, 17/11/23. https://doi.org/10.1109/CAC59555.2023.10452082

Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG. / Wang, Zhiwen; Wang, Bo; He, Xiao et al.
Proceedings - 2023 China Automation Congress, CAC 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 6569-6574 (Proceedings - 2023 China Automation Congress, CAC 2023).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG

AU - Wang, Zhiwen

AU - Wang, Bo

AU - He, Xiao

AU - Fei, Qing

PY - 2023

Y1 - 2023

N2 - In this paper, we aim to develop a reinforcement learning (RL) based algorithm for the task allocation and path planning problem of multi-agent systems where all agents autonomously head to task points with obstacle avoidance. To address the challenge of slow convergence speed and insufficient reward setting when using traditional RL methods, the named Pri-MADDPG algorithm based on prioritized experience replay is proposed. By integrating task allocation and path planning problem, we first construct a framework for multi-agent reinforcement learning training by designing essential elements including appropriate observation space, action space, and reward functions. Then a prioritized experience replay method, in which the value network loss is employed for the priority evaluation, is utilized to enhance policy learning performance. A reward mechanism is further improved through taking into consideration of both global task objectives and individual objectives. To verify the effectiveness of Pri-MADDPG algorithm, experiments are finally carried out with the well-designed reward mechanism. The results demonstrate that all agents can autonomously accomplish task allocation with smooth and highly safe trajectories while achieving faster convergence speed, better stability, and superior performance.

AB - In this paper, we aim to develop a reinforcement learning (RL) based algorithm for the task allocation and path planning problem of multi-agent systems where all agents autonomously head to task points with obstacle avoidance. To address the challenge of slow convergence speed and insufficient reward setting when using traditional RL methods, the named Pri-MADDPG algorithm based on prioritized experience replay is proposed. By integrating task allocation and path planning problem, we first construct a framework for multi-agent reinforcement learning training by designing essential elements including appropriate observation space, action space, and reward functions. Then a prioritized experience replay method, in which the value network loss is employed for the priority evaluation, is utilized to enhance policy learning performance. A reward mechanism is further improved through taking into consideration of both global task objectives and individual objectives. To verify the effectiveness of Pri-MADDPG algorithm, experiments are finally carried out with the well-designed reward mechanism. The results demonstrate that all agents can autonomously accomplish task allocation with smooth and highly safe trajectories while achieving faster convergence speed, better stability, and superior performance.

KW - path planning

KW - prioritized experience replay

KW - reinforcement learning

KW - task allocation

UR - http://www.scopus.com/inward/record.url?scp=85189372716&partnerID=8YFLogxK

U2 - 10.1109/CAC59555.2023.10452082

DO - 10.1109/CAC59555.2023.10452082

M3 - Conference contribution

AN - SCOPUS:85189372716

T3 - Proceedings - 2023 China Automation Congress, CAC 2023

SP - 6569

EP - 6574

BT - Proceedings - 2023 China Automation Congress, CAC 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 China Automation Congress, CAC 2023

Y2 - 17 November 2023 through 19 November 2023

ER -

Research on Multi-Agent Task Allocation and Path Planning Based on Pri-MADDPG

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this