Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation

Jiehao Li; Yingpeng Dai; Junzheng Wang; Xiaohang Su; Ruijun Ma

doi:10.1109/ICRA46639.2022.9812204

Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation

Jiehao Li, Yingpeng Dai, Junzheng Wang^*, Xiaohang Su, Ruijun Ma

^*此作品的通讯作者

自动化学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

19 引用（Scopus）

摘要

This article investigates the real-time semantic segmentation in robot engineering applications based on the Broad Learning System (BLS), and a novel Multi-level Enhancement Layers Network (MELNet) based on BLS framework is proposed for real-time vision tasks in a complex street scene on the unmanned mobile robot. This network mainly solves two problems: (1) mitigating the contradiction between accuracy and speed while maintaining low model complexity, and (2) accurately describing objects based on their shape despite their different sizes. Firstly, the BLS architecture is expanded to the deep network with trainable parameters. This trainable network could adjust its weights in a complex environment, and mitigate the adverse impact of the environment on the complex tasks. Secondly, enhancement layers with the extended enhancement layers could extract both detailed information and semantic information. Moreover, an Upsampling Atrous Spatial Pyramid Pooling (UPASPP) is designed to fuse detail and semantic information to describe object features properly. Finally, in the case of the MNIST dataset and Cityscapes dataset, we get high accuracy with 8.01M parameters and quicker inference speed on a single GTX 1070 Ti card. At the same time, the unmanned mobile robot (BIT-NAZA) is employed to evaluate semantic performance in real-world situations. This reveals that MELNet could be run adequately on the embedded device and effectively operate in the real-robot system.

源语言	英语
主期刊名	2022 IEEE International Conference on Robotics and Automation, ICRA 2022
出版商	Institute of Electrical and Electronics Engineers Inc.
页	9228-9234
页数	7
ISBN（电子版）	9781728196817
DOI	https://doi.org/10.1109/ICRA46639.2022.9812204
出版状态	已出版 - 2022
活动	39th IEEE International Conference on Robotics and Automation, ICRA 2022 - Philadelphia, 美国期限: 23 5月 2022 → 27 5月 2022

出版系列

姓名	Proceedings - IEEE International Conference on Robotics and Automation
ISSN（印刷版）	1050-4729

会议

会议	39th IEEE International Conference on Robotics and Automation, ICRA 2022
国家/地区	美国
市	Philadelphia
时期	23/05/22 → 27/05/22

访问文件

10.1109/ICRA46639.2022.9812204

其它文件与链接

链接到 Scopus 的出版物

引用此

Li, J., Dai, Y., Wang, J., Su, X., & Ma, R. (2022). Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation. 在 2022 IEEE International Conference on Robotics and Automation, ICRA 2022 (页码 9228-9234). (Proceedings - IEEE International Conference on Robotics and Automation). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICRA46639.2022.9812204

@inproceedings{0c96aaee7ac74922bf2393d7ea721290,

title = "Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation",

abstract = "This article investigates the real-time semantic segmentation in robot engineering applications based on the Broad Learning System (BLS), and a novel Multi-level Enhancement Layers Network (MELNet) based on BLS framework is proposed for real-time vision tasks in a complex street scene on the unmanned mobile robot. This network mainly solves two problems: (1) mitigating the contradiction between accuracy and speed while maintaining low model complexity, and (2) accurately describing objects based on their shape despite their different sizes. Firstly, the BLS architecture is expanded to the deep network with trainable parameters. This trainable network could adjust its weights in a complex environment, and mitigate the adverse impact of the environment on the complex tasks. Secondly, enhancement layers with the extended enhancement layers could extract both detailed information and semantic information. Moreover, an Upsampling Atrous Spatial Pyramid Pooling (UPASPP) is designed to fuse detail and semantic information to describe object features properly. Finally, in the case of the MNIST dataset and Cityscapes dataset, we get high accuracy with 8.01M parameters and quicker inference speed on a single GTX 1070 Ti card. At the same time, the unmanned mobile robot (BIT-NAZA) is employed to evaluate semantic performance in real-world situations. This reveals that MELNet could be run adequately on the embedded device and effectively operate in the real-robot system.",

author = "Jiehao Li and Yingpeng Dai and Junzheng Wang and Xiaohang Su and Ruijun Ma",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 39th IEEE International Conference on Robotics and Automation, ICRA 2022 ; Conference date: 23-05-2022 Through 27-05-2022",

year = "2022",

doi = "10.1109/ICRA46639.2022.9812204",

language = "English",

series = "Proceedings - IEEE International Conference on Robotics and Automation",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "9228--9234",

booktitle = "2022 IEEE International Conference on Robotics and Automation, ICRA 2022",

address = "United States",

}

Li, J, Dai, Y, Wang, J, Su, X & Ma, R 2022, Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation. 在 2022 IEEE International Conference on Robotics and Automation, ICRA 2022. Proceedings - IEEE International Conference on Robotics and Automation, Institute of Electrical and Electronics Engineers Inc., 页码 9228-9234, 39th IEEE International Conference on Robotics and Automation, ICRA 2022, Philadelphia, 美国, 23/05/22. https://doi.org/10.1109/ICRA46639.2022.9812204

Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation. / Li, Jiehao; Dai, Yingpeng; Wang, Junzheng 等.
2022 IEEE International Conference on Robotics and Automation, ICRA 2022. Institute of Electrical and Electronics Engineers Inc., 2022. 页码 9228-9234 (Proceedings - IEEE International Conference on Robotics and Automation).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation

AU - Li, Jiehao

AU - Dai, Yingpeng

AU - Wang, Junzheng

AU - Su, Xiaohang

AU - Ma, Ruijun

PY - 2022

Y1 - 2022

N2 - This article investigates the real-time semantic segmentation in robot engineering applications based on the Broad Learning System (BLS), and a novel Multi-level Enhancement Layers Network (MELNet) based on BLS framework is proposed for real-time vision tasks in a complex street scene on the unmanned mobile robot. This network mainly solves two problems: (1) mitigating the contradiction between accuracy and speed while maintaining low model complexity, and (2) accurately describing objects based on their shape despite their different sizes. Firstly, the BLS architecture is expanded to the deep network with trainable parameters. This trainable network could adjust its weights in a complex environment, and mitigate the adverse impact of the environment on the complex tasks. Secondly, enhancement layers with the extended enhancement layers could extract both detailed information and semantic information. Moreover, an Upsampling Atrous Spatial Pyramid Pooling (UPASPP) is designed to fuse detail and semantic information to describe object features properly. Finally, in the case of the MNIST dataset and Cityscapes dataset, we get high accuracy with 8.01M parameters and quicker inference speed on a single GTX 1070 Ti card. At the same time, the unmanned mobile robot (BIT-NAZA) is employed to evaluate semantic performance in real-world situations. This reveals that MELNet could be run adequately on the embedded device and effectively operate in the real-robot system.

AB - This article investigates the real-time semantic segmentation in robot engineering applications based on the Broad Learning System (BLS), and a novel Multi-level Enhancement Layers Network (MELNet) based on BLS framework is proposed for real-time vision tasks in a complex street scene on the unmanned mobile robot. This network mainly solves two problems: (1) mitigating the contradiction between accuracy and speed while maintaining low model complexity, and (2) accurately describing objects based on their shape despite their different sizes. Firstly, the BLS architecture is expanded to the deep network with trainable parameters. This trainable network could adjust its weights in a complex environment, and mitigate the adverse impact of the environment on the complex tasks. Secondly, enhancement layers with the extended enhancement layers could extract both detailed information and semantic information. Moreover, an Upsampling Atrous Spatial Pyramid Pooling (UPASPP) is designed to fuse detail and semantic information to describe object features properly. Finally, in the case of the MNIST dataset and Cityscapes dataset, we get high accuracy with 8.01M parameters and quicker inference speed on a single GTX 1070 Ti card. At the same time, the unmanned mobile robot (BIT-NAZA) is employed to evaluate semantic performance in real-world situations. This reveals that MELNet could be run adequately on the embedded device and effectively operate in the real-robot system.

UR - http://www.scopus.com/inward/record.url?scp=85135827630&partnerID=8YFLogxK

U2 - 10.1109/ICRA46639.2022.9812204

DO - 10.1109/ICRA46639.2022.9812204

M3 - Conference contribution

AN - SCOPUS:85135827630

T3 - Proceedings - IEEE International Conference on Robotics and Automation

SP - 9228

EP - 9234

BT - 2022 IEEE International Conference on Robotics and Automation, ICRA 2022

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 39th IEEE International Conference on Robotics and Automation, ICRA 2022

Y2 - 23 May 2022 through 27 May 2022

ER -

Li J, Dai Y, Wang J, Su X, Ma R. Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation. 在 2022 IEEE International Conference on Robotics and Automation, ICRA 2022. Institute of Electrical and Electronics Engineers Inc. 2022. 页码 9228-9234. (Proceedings - IEEE International Conference on Robotics and Automation). doi: 10.1109/ICRA46639.2022.9812204

Towards Broad Learning Networks on Unmanned Mobile Robot for Semantic Segmentation

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此