TY - GEN
T1 - Target Recognition and 3D Pose Estimation Based on Prior Knowledge and Convolutional Neural Network for Robots
AU - Sun, Jingwen
AU - Zhao, Lijun
AU - Wang, Li
AU - Wang, Ke
AU - Ma, Yuting
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - In the competition of RoboMaster, the robot needs to trigger the target, the 'Energy Mechanism' which consists of nine different dynamic flame numbers, in a nine square area by shooting projectiles. Therefore, 3D target detection should be implemented including target recognition and 3D pose estimation in real-time. As the targets are dynamic flame numbers and quite small in the whole image, it increases the difficulty to detect. The robot should achieve to shoot the target in multi-angle and multi-scale to adjust the competition. To address these issues, we propose a fast and accurate method to detect all nine numbers and estimate each 3D pose based on prior knowledge and convolutional neural network only by a monocular camera. The geometric constraints around the target are employed as prior knowledge when estimating the target pose. Then, we utilize the relative position information to detect the region of each dynamic number in the image, which is recognized by a convolutional neural network trained by flame numbers. Experiments in the actual environment show that our method can achieve the detection of each dynamic number in real-time and high accuracy. The runtime is 29ms on average (about 11ms in detection and 18ms in recognition) and the recognition accuracy is about 94.69%. And our method wins the first place in the technical challenge of 2018 RoboMaster competition.
AB - In the competition of RoboMaster, the robot needs to trigger the target, the 'Energy Mechanism' which consists of nine different dynamic flame numbers, in a nine square area by shooting projectiles. Therefore, 3D target detection should be implemented including target recognition and 3D pose estimation in real-time. As the targets are dynamic flame numbers and quite small in the whole image, it increases the difficulty to detect. The robot should achieve to shoot the target in multi-angle and multi-scale to adjust the competition. To address these issues, we propose a fast and accurate method to detect all nine numbers and estimate each 3D pose based on prior knowledge and convolutional neural network only by a monocular camera. The geometric constraints around the target are employed as prior knowledge when estimating the target pose. Then, we utilize the relative position information to detect the region of each dynamic number in the image, which is recognized by a convolutional neural network trained by flame numbers. Experiments in the actual environment show that our method can achieve the detection of each dynamic number in real-time and high accuracy. The runtime is 29ms on average (about 11ms in detection and 18ms in recognition) and the recognition accuracy is about 94.69%. And our method wins the first place in the technical challenge of 2018 RoboMaster competition.
KW - 3D pose estimation
KW - CNN
KW - target recognition
UR - http://www.scopus.com/inward/record.url?scp=85080094848&partnerID=8YFLogxK
U2 - 10.1109/CAC48633.2019.8996630
DO - 10.1109/CAC48633.2019.8996630
M3 - Conference contribution
AN - SCOPUS:85080094848
T3 - Proceedings - 2019 Chinese Automation Congress, CAC 2019
SP - 298
EP - 304
BT - Proceedings - 2019 Chinese Automation Congress, CAC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 Chinese Automation Congress, CAC 2019
Y2 - 22 November 2019 through 24 November 2019
ER -