TY - GEN
T1 - Reinforcement Learning-based Trajectory Planning for Cooperative Source Seeking
AU - Wu, Weiran
AU - Li, Zhuo
AU - Sun, Jian
AU - Wang, Gang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This paper investigates the cooperative source seeking problem via a networked multi-vehicle system, and proposes a reinforcement learning (RL)-based trajectory planning scheme for the system. In contrast to most existing works, the source position of this work is determined by multiple types of signal fields, and each vehicle in the network is responsible to take measurements of a type of signal field. In another word, a single vehicle cannot localize the source position and must cooperate with its neighbors. To cooperatively localize and simultaneously reach the source position, the vehicle is equipped with a trajectory planning scheme that combines consensus algorithm for cooperative source seeking and an RL-based algorithm for maximizing the value of its corresponding signal field. Thus, each trajectory planner only requires measurements of a field and relative positions between neighboring vehicles, which is especially appealing to global position system (GPS)-denied environments. Simulations are provided to validate the efficacy of the proposed trajectory planning scheme. The RL-based trajectory planner actively leads the vehicle to positions with higher field measurements, without the need for gradient or absolute position information. And we demonstrate the successful deployment of the model learned in single vehicle settings to the cooperative source seeking task.
AB - This paper investigates the cooperative source seeking problem via a networked multi-vehicle system, and proposes a reinforcement learning (RL)-based trajectory planning scheme for the system. In contrast to most existing works, the source position of this work is determined by multiple types of signal fields, and each vehicle in the network is responsible to take measurements of a type of signal field. In another word, a single vehicle cannot localize the source position and must cooperate with its neighbors. To cooperatively localize and simultaneously reach the source position, the vehicle is equipped with a trajectory planning scheme that combines consensus algorithm for cooperative source seeking and an RL-based algorithm for maximizing the value of its corresponding signal field. Thus, each trajectory planner only requires measurements of a field and relative positions between neighboring vehicles, which is especially appealing to global position system (GPS)-denied environments. Simulations are provided to validate the efficacy of the proposed trajectory planning scheme. The RL-based trajectory planner actively leads the vehicle to positions with higher field measurements, without the need for gradient or absolute position information. And we demonstrate the successful deployment of the model learned in single vehicle settings to the cooperative source seeking task.
KW - consensus algorithm
KW - cooperative source seeking
KW - RL
KW - trajectory planning
UR - http://www.scopus.com/inward/record.url?scp=105003272511&partnerID=8YFLogxK
U2 - 10.1109/IARCE64300.2024.00022
DO - 10.1109/IARCE64300.2024.00022
M3 - Conference contribution
AN - SCOPUS:105003272511
T3 - Proceedings - 2024 4th International Conference on Industrial Automation, Robotics and Control Engineering, IARCE 2024
SP - 78
EP - 83
BT - Proceedings - 2024 4th International Conference on Industrial Automation, Robotics and Control Engineering, IARCE 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Industrial Automation, Robotics and Control Engineering, IARCE 2024
Y2 - 15 November 2024 through 17 November 2024
ER -