多节点探测器软着陆的路径规划方法

Xin Wang; Qing Jie Zhao; Chong Chong Yu; Chang Chun Zhang; Yong Quan Chen

doi:10.3873/j.issn.1000-1328.2022.03.012

多节点探测器软着陆的路径规划方法

Translated title of the contribution: Path Planning Method of Soft Landing for Multi-Node Probe

Xin Wang, Qing Jie Zhao^*, Chong Chong Yu, Chang Chun Zhang, Yong Quan Chen

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

The deep space probe with flexible-connected multiple nodes is probably a solution to the possible overturn or rebound in single-node probe landing on an asteroid. Therefore, we construct a probe with flexible-connected three nodes, model the soft landing process, and propose a multi-task deep reinforcement learning method with self-attention mechanism. Each node's state is described referring to the probe base. Furthermore, joint learning among nodes is used to improve their adaptability. At the same time, the self-attention is applied to make the nodes focus on their own tasks and learn better strategies to obtain higher rewards for feature extraction of the probe and obstacles. Experimental results show that the method proposed in this paper is more beneficial to the stable landing of the probe compared with other methods.

Translated title of the contribution	Path Planning Method of Soft Landing for Multi-Node Probe
Original language	Chinese (Traditional)
Pages (from-to)	366-373
Number of pages	8
Journal	Yuhang Xuebao/Journal of Astronautics
Volume	43
Issue number	3
DOIs	https://doi.org/10.3873/j.issn.1000-1328.2022.03.012
Publication status	Published - 30 Mar 2022

Access to Document

10.3873/j.issn.1000-1328.2022.03.012

Cite this

@article{c08d7ea9812442e09b367d4eb4d517cb,

title = "多节点探测器软着陆的路径规划方法",

abstract = "The deep space probe with flexible-connected multiple nodes is probably a solution to the possible overturn or rebound in single-node probe landing on an asteroid. Therefore, we construct a probe with flexible-connected three nodes, model the soft landing process, and propose a multi-task deep reinforcement learning method with self-attention mechanism. Each node's state is described referring to the probe base. Furthermore, joint learning among nodes is used to improve their adaptability. At the same time, the self-attention is applied to make the nodes focus on their own tasks and learn better strategies to obtain higher rewards for feature extraction of the probe and obstacles. Experimental results show that the method proposed in this paper is more beneficial to the stable landing of the probe compared with other methods.",

keywords = "Deep reinforcement learning, Deep space probe, Multi-task learning, Self-attention mechanism, Soft landing",

author = "Xin Wang and Zhao, {Qing Jie} and Yu, {Chong Chong} and Zhang, {Chang Chun} and Chen, {Yong Quan}",

year = "2022",

month = mar,

day = "30",

doi = "10.3873/j.issn.1000-1328.2022.03.012",

language = "繁体中文",

volume = "43",

pages = "366--373",

journal = "Yuhang Xuebao/Journal of Astronautics",

issn = "1000-1328",

publisher = "Chinese Society of Astronautics",

number = "3",

}

TY - JOUR

T1 - 多节点探测器软着陆的路径规划方法

AU - Wang, Xin

AU - Zhao, Qing Jie

AU - Yu, Chong Chong

AU - Zhang, Chang Chun

AU - Chen, Yong Quan

PY - 2022/3/30

Y1 - 2022/3/30

N2 - The deep space probe with flexible-connected multiple nodes is probably a solution to the possible overturn or rebound in single-node probe landing on an asteroid. Therefore, we construct a probe with flexible-connected three nodes, model the soft landing process, and propose a multi-task deep reinforcement learning method with self-attention mechanism. Each node's state is described referring to the probe base. Furthermore, joint learning among nodes is used to improve their adaptability. At the same time, the self-attention is applied to make the nodes focus on their own tasks and learn better strategies to obtain higher rewards for feature extraction of the probe and obstacles. Experimental results show that the method proposed in this paper is more beneficial to the stable landing of the probe compared with other methods.

AB - The deep space probe with flexible-connected multiple nodes is probably a solution to the possible overturn or rebound in single-node probe landing on an asteroid. Therefore, we construct a probe with flexible-connected three nodes, model the soft landing process, and propose a multi-task deep reinforcement learning method with self-attention mechanism. Each node's state is described referring to the probe base. Furthermore, joint learning among nodes is used to improve their adaptability. At the same time, the self-attention is applied to make the nodes focus on their own tasks and learn better strategies to obtain higher rewards for feature extraction of the probe and obstacles. Experimental results show that the method proposed in this paper is more beneficial to the stable landing of the probe compared with other methods.

KW - Deep reinforcement learning

KW - Deep space probe

KW - Multi-task learning

KW - Self-attention mechanism

KW - Soft landing

UR - http://www.scopus.com/inward/record.url?scp=85130183570&partnerID=8YFLogxK

U2 - 10.3873/j.issn.1000-1328.2022.03.012

DO - 10.3873/j.issn.1000-1328.2022.03.012

M3 - 文章

AN - SCOPUS:85130183570

SN - 1000-1328

VL - 43

SP - 366

EP - 373

JO - Yuhang Xuebao/Journal of Astronautics

JF - Yuhang Xuebao/Journal of Astronautics

IS - 3

ER -

多节点探测器软着陆的路径规划方法

Abstract

Access to Document

Other files and links

Fingerprint

Cite this