TY - GEN
T1 - Trajectory Generation by Sparse Demonstration Learning and Minimum Snap-based Optimization
AU - Xu, Taoying
AU - She, Haoping
AU - Si, Weiyong
AU - Li, Chuanjun
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In this paper, dynamic time warping function is used to establish an optimal control system for four-rotor unmanned aerial vehicle (UAV) to learn how to optimize trajectory planning from sparse demonstration. By continuous Pontryagin Differentiable Programming, UAV learns the objective function based on sparse waypoints demonstration. However, due to the small sample data of sparse demonstration learning, there is a problem of low precision, and Pontryagin's Minimum Principle itself has the limitation of easily falling into the local optimal solution. So, this paper adopts the Minimum Snap trajectory algorithm that meets the dynamic constraints of the agent to generate a planned trajectory, to weighted combination with learning trajectory solved based on continuous Pontryagin Differentiable Programming, and the resulting optimized trajectory has the advantages of small demonstration learning difference loss, reasonable time allocation and reasonable planning, so that UAV can have certain generalization capability and optimize a reasonable trajectory with less energy loss. Finally, the feasibility of the proposed method is verified by the simulation experiment of the four-rotor UAV.
AB - In this paper, dynamic time warping function is used to establish an optimal control system for four-rotor unmanned aerial vehicle (UAV) to learn how to optimize trajectory planning from sparse demonstration. By continuous Pontryagin Differentiable Programming, UAV learns the objective function based on sparse waypoints demonstration. However, due to the small sample data of sparse demonstration learning, there is a problem of low precision, and Pontryagin's Minimum Principle itself has the limitation of easily falling into the local optimal solution. So, this paper adopts the Minimum Snap trajectory algorithm that meets the dynamic constraints of the agent to generate a planned trajectory, to weighted combination with learning trajectory solved based on continuous Pontryagin Differentiable Programming, and the resulting optimized trajectory has the advantages of small demonstration learning difference loss, reasonable time allocation and reasonable planning, so that UAV can have certain generalization capability and optimize a reasonable trajectory with less energy loss. Finally, the feasibility of the proposed method is verified by the simulation experiment of the four-rotor UAV.
KW - continuous Pontryagin Differentiable Programming
KW - dynamic time warping
KW - minimum snap
KW - sparse demonstration learning
KW - trajectory planning
UR - http://www.scopus.com/inward/record.url?scp=85208643081&partnerID=8YFLogxK
U2 - 10.1109/ICAC61394.2024.10718821
DO - 10.1109/ICAC61394.2024.10718821
M3 - Conference contribution
AN - SCOPUS:85208643081
T3 - ICAC 2024 - 29th International Conference on Automation and Computing
BT - ICAC 2024 - 29th International Conference on Automation and Computing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 29th International Conference on Automation and Computing, ICAC 2024
Y2 - 28 August 2024 through 30 August 2024
ER -