Hierarchical path planner combining probabilistic roadmap and deep deterministic policy gradient for unmanned ground vehicles with non-holonomic constraints

Jie Fan; Xudong Zhang; Kun Zheng; Yuan Zou; Nana Zhou

doi:10.1016/j.jfranklin.2024.106821

Hierarchical path planner combining probabilistic roadmap and deep deterministic policy gradient for unmanned ground vehicles with non-holonomic constraints

Jie Fan, Xudong Zhang^*, Kun Zheng, Yuan Zou, Nana Zhou

^*Corresponding author for this work

School of Mechanical Engineering

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Path planning plays a vital role in autonomous driving as it needs to guide the vehicle to achieve the target position without collision while satisfying the vehicle's non-holonomic constraints. A good path planner should be able to cope with sophisticated environments and try to make the planned path as short as possible. Traditional path planning methods when being applied on four-wheel vehicles may suffer from one of the following disadvantages including conflicting with non-holonomic constraints, far from close-optimal length solution, or easy to get stuck in specific environments. In this paper, a hierarchical path planning method for a four-wheel vehicle combining probabilistic roadmap (PRM) and deep deterministic policy gradient (DDPG) is proposed to address existing problems. In the upper level, PRM is used to generate a guidance path quickly, which doesn't consider the vehicle's non-holonomic constraints but gives the sketch of the path that helps the vehicle to jump out of some dilemmas in specific environments while maintaining a relatively short path distance. In the lower level, reinforcement learning, specifically DDPG, is used to optimize the guidance path generated by PRM considering multiple factors including non-holonomic constraints and obstacle avoidance. The proposed method is validated in four different environments including a simple map, an easy-to-stuck specific map, a maze-like complex map and an office-like complex map. Results show that the proposed method could generate a smooth path satisfying non-holonomic constraints while demonstrating desirable obstacle avoidance performance. The planned path of the proposed method is superior to that of both traditional and state-of-the-art methods in terms of path length and smoothness, which could be 1.3–12.9 % shorter and 2.6–52.1 % smoother in different environments.

Original language	English
Article number	106821
Journal	Journal of the Franklin Institute
Volume	361
Issue number	8
DOIs	https://doi.org/10.1016/j.jfranklin.2024.106821
Publication status	Published - May 2024

Access to Document

10.1016/j.jfranklin.2024.106821

Cite this

@article{8760c78a568a48f58b7f34b589370130,

title = "Hierarchical path planner combining probabilistic roadmap and deep deterministic policy gradient for unmanned ground vehicles with non-holonomic constraints",

abstract = "Path planning plays a vital role in autonomous driving as it needs to guide the vehicle to achieve the target position without collision while satisfying the vehicle's non-holonomic constraints. A good path planner should be able to cope with sophisticated environments and try to make the planned path as short as possible. Traditional path planning methods when being applied on four-wheel vehicles may suffer from one of the following disadvantages including conflicting with non-holonomic constraints, far from close-optimal length solution, or easy to get stuck in specific environments. In this paper, a hierarchical path planning method for a four-wheel vehicle combining probabilistic roadmap (PRM) and deep deterministic policy gradient (DDPG) is proposed to address existing problems. In the upper level, PRM is used to generate a guidance path quickly, which doesn't consider the vehicle's non-holonomic constraints but gives the sketch of the path that helps the vehicle to jump out of some dilemmas in specific environments while maintaining a relatively short path distance. In the lower level, reinforcement learning, specifically DDPG, is used to optimize the guidance path generated by PRM considering multiple factors including non-holonomic constraints and obstacle avoidance. The proposed method is validated in four different environments including a simple map, an easy-to-stuck specific map, a maze-like complex map and an office-like complex map. Results show that the proposed method could generate a smooth path satisfying non-holonomic constraints while demonstrating desirable obstacle avoidance performance. The planned path of the proposed method is superior to that of both traditional and state-of-the-art methods in terms of path length and smoothness, which could be 1.3–12.9 % shorter and 2.6–52.1 % smoother in different environments.",

author = "Jie Fan and Xudong Zhang and Kun Zheng and Yuan Zou and Nana Zhou",

note = "Publisher Copyright: {\textcopyright} 2024 The Franklin Institute",

year = "2024",

month = may,

doi = "10.1016/j.jfranklin.2024.106821",

language = "English",

volume = "361",

journal = "Journal of the Franklin Institute",

issn = "0016-0032",

publisher = "Elsevier Ltd.",

number = "8",

}

TY - JOUR

T1 - Hierarchical path planner combining probabilistic roadmap and deep deterministic policy gradient for unmanned ground vehicles with non-holonomic constraints

AU - Fan, Jie

AU - Zhang, Xudong

AU - Zheng, Kun

AU - Zou, Yuan

AU - Zhou, Nana

PY - 2024/5

Y1 - 2024/5

N2 - Path planning plays a vital role in autonomous driving as it needs to guide the vehicle to achieve the target position without collision while satisfying the vehicle's non-holonomic constraints. A good path planner should be able to cope with sophisticated environments and try to make the planned path as short as possible. Traditional path planning methods when being applied on four-wheel vehicles may suffer from one of the following disadvantages including conflicting with non-holonomic constraints, far from close-optimal length solution, or easy to get stuck in specific environments. In this paper, a hierarchical path planning method for a four-wheel vehicle combining probabilistic roadmap (PRM) and deep deterministic policy gradient (DDPG) is proposed to address existing problems. In the upper level, PRM is used to generate a guidance path quickly, which doesn't consider the vehicle's non-holonomic constraints but gives the sketch of the path that helps the vehicle to jump out of some dilemmas in specific environments while maintaining a relatively short path distance. In the lower level, reinforcement learning, specifically DDPG, is used to optimize the guidance path generated by PRM considering multiple factors including non-holonomic constraints and obstacle avoidance. The proposed method is validated in four different environments including a simple map, an easy-to-stuck specific map, a maze-like complex map and an office-like complex map. Results show that the proposed method could generate a smooth path satisfying non-holonomic constraints while demonstrating desirable obstacle avoidance performance. The planned path of the proposed method is superior to that of both traditional and state-of-the-art methods in terms of path length and smoothness, which could be 1.3–12.9 % shorter and 2.6–52.1 % smoother in different environments.

AB - Path planning plays a vital role in autonomous driving as it needs to guide the vehicle to achieve the target position without collision while satisfying the vehicle's non-holonomic constraints. A good path planner should be able to cope with sophisticated environments and try to make the planned path as short as possible. Traditional path planning methods when being applied on four-wheel vehicles may suffer from one of the following disadvantages including conflicting with non-holonomic constraints, far from close-optimal length solution, or easy to get stuck in specific environments. In this paper, a hierarchical path planning method for a four-wheel vehicle combining probabilistic roadmap (PRM) and deep deterministic policy gradient (DDPG) is proposed to address existing problems. In the upper level, PRM is used to generate a guidance path quickly, which doesn't consider the vehicle's non-holonomic constraints but gives the sketch of the path that helps the vehicle to jump out of some dilemmas in specific environments while maintaining a relatively short path distance. In the lower level, reinforcement learning, specifically DDPG, is used to optimize the guidance path generated by PRM considering multiple factors including non-holonomic constraints and obstacle avoidance. The proposed method is validated in four different environments including a simple map, an easy-to-stuck specific map, a maze-like complex map and an office-like complex map. Results show that the proposed method could generate a smooth path satisfying non-holonomic constraints while demonstrating desirable obstacle avoidance performance. The planned path of the proposed method is superior to that of both traditional and state-of-the-art methods in terms of path length and smoothness, which could be 1.3–12.9 % shorter and 2.6–52.1 % smoother in different environments.

UR - http://www.scopus.com/inward/record.url?scp=85190826452&partnerID=8YFLogxK

U2 - 10.1016/j.jfranklin.2024.106821

DO - 10.1016/j.jfranklin.2024.106821

M3 - Article

AN - SCOPUS:85190826452

SN - 0016-0032

VL - 361

JO - Journal of the Franklin Institute

JF - Journal of the Franklin Institute

IS - 8

M1 - 106821

ER -

Hierarchical path planner combining probabilistic roadmap and deep deterministic policy gradient for unmanned ground vehicles with non-holonomic constraints

Abstract

Access to Document

Other files and links

Fingerprint

Cite this