Releasing source locating based on Multi-Agent Reinforcement Learning with reward function designed by maximum entropy

Zhi Pu Wang, Guang Rong Zeng, Lie Wei Deng, Wang Cao, Yao Guo

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

This paper is focused on locating the actual releasing source in the environment of multiple disturbance sources. The actual releasing source is located with multiple mobile sensors. In an attempt to avoid mobile sensors falling into the disturbance releasing source and gather at the actual releasing source quickly, an improved Multi-Agent Reinforcement Learning (MARL) with novel designed reward function is applied to guide the movement of mobile sensors. To ensure finding the actual releasing source with maximum releasing concentration, the reward function is designed based on maximum entropy (ME). Finally, MARL with reward function designed by ME and normal MARL are simulated and compared to verify the efficiency and advantage of this method.

源语言英语
主期刊名Proceedings of the 41st Chinese Control Conference, CCC 2022
编辑Zhijun Li, Jian Sun
出版商IEEE Computer Society
4688-4693
页数6
ISBN(电子版)9789887581536
DOI
出版状态已出版 - 2022
已对外发布
活动41st Chinese Control Conference, CCC 2022 - Hefei, 中国
期限: 25 7月 202227 7月 2022

出版系列

姓名Chinese Control Conference, CCC
2022-July
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议41st Chinese Control Conference, CCC 2022
国家/地区中国
Hefei
时期25/07/2227/07/22

指纹

探究 'Releasing source locating based on Multi-Agent Reinforcement Learning with reward function designed by maximum entropy' 的科研主题。它们共同构成独一无二的指纹。

引用此