TY - JOUR
T1 - Hierarchical Learning with Heuristic Guidance for Multi-task Assignment and Distributed Planning in Interactive Scenarios
AU - Chen, Siyuan
AU - Wang, Meiling
AU - Song, Wenjie
N1 - Publisher Copyright:
IEEE
PY - 2024
Y1 - 2024
N2 - A unified framework for multi-agent task assignment and distributed trajectory planning that can autonomously adapt to complex interactive environments and multi-task constraints has always been the bottleneck of unmanned cluster task. In multi-agent systems such as smart parking lots and smart intersections, tasks are time-varying with dynamically interactive agents, which cause the challenges such as inefficient assignment, spatiotemporal conflict and poor adaptive ability. Focusing on them, this paper proposes a hierarchical framework that combines centralized task assignment module with a multi-agent reinforcement learning based distributed trajectory planning module, which has a good scalability and task adaptability. The benefit of the proposed hierarchical framework is that it utilizes behavioral heuristic of congestion level and planning cost during assignment to motivate the appropriate assignment, which achieve an organic integration of multiple objectives. In terms of spatiotemporal conflicts for multi-agent, a weighted network structure is designed to capture dynamic obstacle information while introducing conflict constraints for collision avoidance policy optimization. Furthermore, in order to cope with changing tasks, re-assignment and re-planning mechanisms are incorporated into the framework, as well as the graph encoding layer which is adaptive to the uncertain tasks. Extensive experiments are conducted in autonomous parking scenarios to validate the effectiveness of the approach in task assignment and path conflict mitigation. Compared with other state of art methods, the task assignment and overall success rates have increased by 12.1% and 6.8%, respectively, with negligible computing time.
AB - A unified framework for multi-agent task assignment and distributed trajectory planning that can autonomously adapt to complex interactive environments and multi-task constraints has always been the bottleneck of unmanned cluster task. In multi-agent systems such as smart parking lots and smart intersections, tasks are time-varying with dynamically interactive agents, which cause the challenges such as inefficient assignment, spatiotemporal conflict and poor adaptive ability. Focusing on them, this paper proposes a hierarchical framework that combines centralized task assignment module with a multi-agent reinforcement learning based distributed trajectory planning module, which has a good scalability and task adaptability. The benefit of the proposed hierarchical framework is that it utilizes behavioral heuristic of congestion level and planning cost during assignment to motivate the appropriate assignment, which achieve an organic integration of multiple objectives. In terms of spatiotemporal conflicts for multi-agent, a weighted network structure is designed to capture dynamic obstacle information while introducing conflict constraints for collision avoidance policy optimization. Furthermore, in order to cope with changing tasks, re-assignment and re-planning mechanisms are incorporated into the framework, as well as the graph encoding layer which is adaptive to the uncertain tasks. Extensive experiments are conducted in autonomous parking scenarios to validate the effectiveness of the approach in task assignment and path conflict mitigation. Compared with other state of art methods, the task assignment and overall success rates have increased by 12.1% and 6.8%, respectively, with negligible computing time.
KW - Collision Avoidance
KW - Multi-agent Reinforcement Learning (MARL)
KW - Navigation
KW - Planning
KW - Reinforcement learning
KW - Task Assignment
KW - Task analysis
KW - Trajectory
KW - Trajectory planning
KW - Vehicle dynamics
UR - http://www.scopus.com/inward/record.url?scp=85186995467&partnerID=8YFLogxK
U2 - 10.1109/TIV.2024.3369730
DO - 10.1109/TIV.2024.3369730
M3 - Article
AN - SCOPUS:85186995467
SN - 2379-8858
SP - 1
EP - 13
JO - IEEE Transactions on Intelligent Vehicles
JF - IEEE Transactions on Intelligent Vehicles
ER -