Q-learning algorithm for path-planning to maneuver through a satellite cluster

Xiaoyu Chu, Kyle T. Alfriend, Jingrui Zhang, Yao Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

In this paper, a path planning method for maneuvering through a satellite cluster using Q-learning is presented. An on-orbit servicing spacecraft is supposed to rendezvous with the failed central satellite of a formation and avoid collisions with the other satellites. The dynamic model of the satellite cluster is first established by Lawden equations. Then the theory of Q-learning is introduced and the reward shaping is specified to guide the learning system quickly to success. Furthermore, combining Q-learning with deep neural networks, deep Q-network (DQN) is employed when the dimension of the problem is enormous. Finally, the rendezvous mission is simulated in 2D and 3D scenarios separately to demonstrate the effectiveness of the proposed method.

Original languageEnglish
Title of host publicationAAS/AIAA Astrodynamics Specialist Conference, 2018
EditorsPuneet Singla, Ryan M. Weisman, Belinda G. Marchand, Brandon A. Jones
PublisherUnivelt Inc.
Pages2063-2082
Number of pages20
ISBN (Print)9780877036579
Publication statusPublished - 2018
EventAAS/AIAA Astrodynamics Specialist Conference, 2018 - Snowbird, United States
Duration: 19 Aug 201823 Aug 2018

Publication series

NameAdvances in the Astronautical Sciences
Volume167
ISSN (Print)0065-3438

Conference

ConferenceAAS/AIAA Astrodynamics Specialist Conference, 2018
Country/TerritoryUnited States
CitySnowbird
Period19/08/1823/08/18

Fingerprint

Dive into the research topics of 'Q-learning algorithm for path-planning to maneuver through a satellite cluster'. Together they form a unique fingerprint.

Cite this