Research on UAV Coverage Search Based on DDQN in Unknown Environments

Gaofeng Deng; Xiaolan Yao; Bo Wang; Xiao He; Qing Fei

doi:10.1109/CAC59555.2023.10451197

Research on UAV Coverage Search Based on DDQN in Unknown Environments

Gaofeng Deng, Xiaolan Yao, Bo Wang, Xiao He, Qing Fei

自动化学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

The utilization of unmanned aerial vehicle (UAV) for area coverage search is highly sought after in both military and civil domains, including but not limited to traversal search, mission reconnaissance, patrol detection, wildfire suppression control, remote sensing mapping, agricultural preservation, and accident search and rescue. This paper focuses on the problem of area coverage search for a single UAV in environments with the presence of unknown dynamic and static targets as well as hazardous areas. Here the UAV only knows the state of a small area and remembers the actions of the last time step without long-term memory. The objective is to design an adaptive and transferable algorithm for the UAV to find all static and dynamic targets with the minimum path repetition rate in the task area where both dangerous areas and completely unknown target information exist. Because the UAV has limited ability to observe all the map information, a partially observable Markov decision process is first formulated. Then we develop a coverage search algorithm based on Double Deep Q Network (DDQN) with the help of curriculum learning. By designing multiple constraint reward functions and employing path repetition rate and target batting average as evaluation metrics, the proposed algorithm facilitates the rapid adaptation of UAVs to diverse task environments. Simulation environment and algorithm models are finally established to illustrate the efficacy of the algorithm, which shows that the proposed algorithm with curriculum learning has rapid convergence, minimal path redundancy, high target acquisition rate, robust portability, and adaptability to variations in map area, hazard zones, and target quantity.

源语言	英语
主期刊名	Proceedings - 2023 China Automation Congress, CAC 2023
出版商	Institute of Electrical and Electronics Engineers Inc.
页	2826-2831
页数	6
ISBN（电子版）	9798350303759
DOI	https://doi.org/10.1109/CAC59555.2023.10451197
出版状态	已出版 - 2023
活动	2023 China Automation Congress, CAC 2023 - Chongqing, 中国期限: 17 11月 2023 → 19 11月 2023

出版系列

姓名	Proceedings - 2023 China Automation Congress, CAC 2023

会议

会议	2023 China Automation Congress, CAC 2023
国家/地区	中国
市	Chongqing
时期	17/11/23 → 19/11/23

访问文件

10.1109/CAC59555.2023.10451197

其它文件与链接

链接到 Scopus 的出版物

引用此

@inproceedings{7c64c54bf4c34e879bab662b511456d0,

title = "Research on UAV Coverage Search Based on DDQN in Unknown Environments",

abstract = "The utilization of unmanned aerial vehicle (UAV) for area coverage search is highly sought after in both military and civil domains, including but not limited to traversal search, mission reconnaissance, patrol detection, wildfire suppression control, remote sensing mapping, agricultural preservation, and accident search and rescue. This paper focuses on the problem of area coverage search for a single UAV in environments with the presence of unknown dynamic and static targets as well as hazardous areas. Here the UAV only knows the state of a small area and remembers the actions of the last time step without long-term memory. The objective is to design an adaptive and transferable algorithm for the UAV to find all static and dynamic targets with the minimum path repetition rate in the task area where both dangerous areas and completely unknown target information exist. Because the UAV has limited ability to observe all the map information, a partially observable Markov decision process is first formulated. Then we develop a coverage search algorithm based on Double Deep Q Network (DDQN) with the help of curriculum learning. By designing multiple constraint reward functions and employing path repetition rate and target batting average as evaluation metrics, the proposed algorithm facilitates the rapid adaptation of UAVs to diverse task environments. Simulation environment and algorithm models are finally established to illustrate the efficacy of the algorithm, which shows that the proposed algorithm with curriculum learning has rapid convergence, minimal path redundancy, high target acquisition rate, robust portability, and adaptability to variations in map area, hazard zones, and target quantity.",

keywords = "Double Deep Q Network, coverage search, dynamic targets, partially observable Markov decision process, unknown environment",

author = "Gaofeng Deng and Xiaolan Yao and Bo Wang and Xiao He and Qing Fei",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 China Automation Congress, CAC 2023 ; Conference date: 17-11-2023 Through 19-11-2023",

year = "2023",

doi = "10.1109/CAC59555.2023.10451197",

language = "English",

series = "Proceedings - 2023 China Automation Congress, CAC 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "2826--2831",

booktitle = "Proceedings - 2023 China Automation Congress, CAC 2023",

address = "United States",

}

Deng, G, Yao, X, Wang, B, He, X & Fei, Q 2023, Research on UAV Coverage Search Based on DDQN in Unknown Environments. 在 Proceedings - 2023 China Automation Congress, CAC 2023. Proceedings - 2023 China Automation Congress, CAC 2023, Institute of Electrical and Electronics Engineers Inc., 页码 2826-2831, 2023 China Automation Congress, CAC 2023, Chongqing, 中国, 17/11/23. https://doi.org/10.1109/CAC59555.2023.10451197

Research on UAV Coverage Search Based on DDQN in Unknown Environments. / Deng, Gaofeng; Yao, Xiaolan; Wang, Bo 等.
Proceedings - 2023 China Automation Congress, CAC 2023. Institute of Electrical and Electronics Engineers Inc., 2023. 页码 2826-2831 (Proceedings - 2023 China Automation Congress, CAC 2023).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Research on UAV Coverage Search Based on DDQN in Unknown Environments

AU - Deng, Gaofeng

AU - Yao, Xiaolan

AU - Wang, Bo

AU - He, Xiao

AU - Fei, Qing

PY - 2023

Y1 - 2023

N2 - The utilization of unmanned aerial vehicle (UAV) for area coverage search is highly sought after in both military and civil domains, including but not limited to traversal search, mission reconnaissance, patrol detection, wildfire suppression control, remote sensing mapping, agricultural preservation, and accident search and rescue. This paper focuses on the problem of area coverage search for a single UAV in environments with the presence of unknown dynamic and static targets as well as hazardous areas. Here the UAV only knows the state of a small area and remembers the actions of the last time step without long-term memory. The objective is to design an adaptive and transferable algorithm for the UAV to find all static and dynamic targets with the minimum path repetition rate in the task area where both dangerous areas and completely unknown target information exist. Because the UAV has limited ability to observe all the map information, a partially observable Markov decision process is first formulated. Then we develop a coverage search algorithm based on Double Deep Q Network (DDQN) with the help of curriculum learning. By designing multiple constraint reward functions and employing path repetition rate and target batting average as evaluation metrics, the proposed algorithm facilitates the rapid adaptation of UAVs to diverse task environments. Simulation environment and algorithm models are finally established to illustrate the efficacy of the algorithm, which shows that the proposed algorithm with curriculum learning has rapid convergence, minimal path redundancy, high target acquisition rate, robust portability, and adaptability to variations in map area, hazard zones, and target quantity.

AB - The utilization of unmanned aerial vehicle (UAV) for area coverage search is highly sought after in both military and civil domains, including but not limited to traversal search, mission reconnaissance, patrol detection, wildfire suppression control, remote sensing mapping, agricultural preservation, and accident search and rescue. This paper focuses on the problem of area coverage search for a single UAV in environments with the presence of unknown dynamic and static targets as well as hazardous areas. Here the UAV only knows the state of a small area and remembers the actions of the last time step without long-term memory. The objective is to design an adaptive and transferable algorithm for the UAV to find all static and dynamic targets with the minimum path repetition rate in the task area where both dangerous areas and completely unknown target information exist. Because the UAV has limited ability to observe all the map information, a partially observable Markov decision process is first formulated. Then we develop a coverage search algorithm based on Double Deep Q Network (DDQN) with the help of curriculum learning. By designing multiple constraint reward functions and employing path repetition rate and target batting average as evaluation metrics, the proposed algorithm facilitates the rapid adaptation of UAVs to diverse task environments. Simulation environment and algorithm models are finally established to illustrate the efficacy of the algorithm, which shows that the proposed algorithm with curriculum learning has rapid convergence, minimal path redundancy, high target acquisition rate, robust portability, and adaptability to variations in map area, hazard zones, and target quantity.

KW - Double Deep Q Network

KW - coverage search

KW - dynamic targets

KW - partially observable Markov decision process

KW - unknown environment

UR - http://www.scopus.com/inward/record.url?scp=85189311176&partnerID=8YFLogxK

U2 - 10.1109/CAC59555.2023.10451197

DO - 10.1109/CAC59555.2023.10451197

M3 - Conference contribution

AN - SCOPUS:85189311176

T3 - Proceedings - 2023 China Automation Congress, CAC 2023

SP - 2826

EP - 2831

BT - Proceedings - 2023 China Automation Congress, CAC 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 China Automation Congress, CAC 2023

Y2 - 17 November 2023 through 19 November 2023

ER -

Research on UAV Coverage Search Based on DDQN in Unknown Environments

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此