CADNet: Context-aggregated DCPPM monocular depth estimation network

Canjie Zhu; Huifang Sun; Mingfeng Lu; Feng Zhang

doi:10.1109/EEISS62553.2024.00035

CADNet: Context-aggregated DCPPM monocular depth estimation network

Canjie Zhu, Huifang Sun, Mingfeng Lu^*, Feng Zhang

^*此作品的通讯作者

信息与电子学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Monocular depth estimation is a crucial technology for comprehending scenes, and acquiring global contextual information is pivotal for enhancing depth estimation accuracy. Traditional approaches for incorporating global context information involve pooling feature maps of varying receptive field sizes. Nevertheless, they fail to address challenges such as object boundary distortion and the loss of local detail information caused by complex textures and geometric structures in scenes. To tackle these issues, this paper proposes a novel monocular depth estimation model called CADNet (Context-aggregated DCPPM monocular depth estimation network). This model leverages a multi-scale context aggregation module, DCPPM, to effectively aggregate local features into a global framework, thereby resolving the problem of local detail loss during network training. Experimental results demonstrate that the CADNet model surpasses the NewCRFs model in complex scene boundary detection and capturing local object details. Furthermore, with a 6.27% reduction in parameter count, the CADNet model achieves a noteworthy 9.82% decrease in Sq Rel error on the KITTI dataset and exhibits remarkable performance in general depth estimation metrics for both indoor and outdoor scenes.

源语言	英语
主期刊名	Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024
出版商	Institute of Electrical and Electronics Engineers Inc.
页	161-165
页数	5
ISBN（电子版）	9798350351033
DOI	https://doi.org/10.1109/EEISS62553.2024.00035
出版状态	已出版 - 2024
活动	2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024 - Changsha, 中国期限: 13 1月 2024 → 15 1月 2024

出版系列

姓名	Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024

会议

会议	2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024
国家/地区	中国
市	Changsha
时期	13/01/24 → 15/01/24

访问文件

10.1109/EEISS62553.2024.00035

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhu, C., Sun, H., Lu, M., & Zhang, F. (2024). CADNet: Context-aggregated DCPPM monocular depth estimation network. 在 Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024 (页码 161-165). (Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/EEISS62553.2024.00035

Zhu, Canjie ; Sun, Huifang ; Lu, Mingfeng 等. / CADNet : Context-aggregated DCPPM monocular depth estimation network. Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024. Institute of Electrical and Electronics Engineers Inc., 2024. 页码 161-165 (Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024).

@inproceedings{2e77206312ce4857bb703e092746343b,

title = "CADNet: Context-aggregated DCPPM monocular depth estimation network",

abstract = "Monocular depth estimation is a crucial technology for comprehending scenes, and acquiring global contextual information is pivotal for enhancing depth estimation accuracy. Traditional approaches for incorporating global context information involve pooling feature maps of varying receptive field sizes. Nevertheless, they fail to address challenges such as object boundary distortion and the loss of local detail information caused by complex textures and geometric structures in scenes. To tackle these issues, this paper proposes a novel monocular depth estimation model called CADNet (Context-aggregated DCPPM monocular depth estimation network). This model leverages a multi-scale context aggregation module, DCPPM, to effectively aggregate local features into a global framework, thereby resolving the problem of local detail loss during network training. Experimental results demonstrate that the CADNet model surpasses the NewCRFs model in complex scene boundary detection and capturing local object details. Furthermore, with a 6.27% reduction in parameter count, the CADNet model achieves a noteworthy 9.82% decrease in Sq Rel error on the KITTI dataset and exhibits remarkable performance in general depth estimation metrics for both indoor and outdoor scenes.",

keywords = "Contextual information aggregation, Depth boundaries, Fully connected conditional random field, Monocular depth estimation, Scene comprehension",

author = "Canjie Zhu and Huifang Sun and Mingfeng Lu and Feng Zhang",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024 ; Conference date: 13-01-2024 Through 15-01-2024",

year = "2024",

doi = "10.1109/EEISS62553.2024.00035",

language = "English",

series = "Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "161--165",

booktitle = "Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024",

address = "United States",

}

Zhu, C, Sun, H, Lu, M & Zhang, F 2024, CADNet: Context-aggregated DCPPM monocular depth estimation network. 在 Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024. Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024, Institute of Electrical and Electronics Engineers Inc., 页码 161-165, 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024, Changsha, 中国, 13/01/24. https://doi.org/10.1109/EEISS62553.2024.00035

CADNet: Context-aggregated DCPPM monocular depth estimation network. / Zhu, Canjie; Sun, Huifang; Lu, Mingfeng 等.
Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024. Institute of Electrical and Electronics Engineers Inc., 2024. 页码 161-165 (Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - CADNet

T2 - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024

AU - Zhu, Canjie

AU - Sun, Huifang

AU - Lu, Mingfeng

AU - Zhang, Feng

PY - 2024

Y1 - 2024

N2 - Monocular depth estimation is a crucial technology for comprehending scenes, and acquiring global contextual information is pivotal for enhancing depth estimation accuracy. Traditional approaches for incorporating global context information involve pooling feature maps of varying receptive field sizes. Nevertheless, they fail to address challenges such as object boundary distortion and the loss of local detail information caused by complex textures and geometric structures in scenes. To tackle these issues, this paper proposes a novel monocular depth estimation model called CADNet (Context-aggregated DCPPM monocular depth estimation network). This model leverages a multi-scale context aggregation module, DCPPM, to effectively aggregate local features into a global framework, thereby resolving the problem of local detail loss during network training. Experimental results demonstrate that the CADNet model surpasses the NewCRFs model in complex scene boundary detection and capturing local object details. Furthermore, with a 6.27% reduction in parameter count, the CADNet model achieves a noteworthy 9.82% decrease in Sq Rel error on the KITTI dataset and exhibits remarkable performance in general depth estimation metrics for both indoor and outdoor scenes.

AB - Monocular depth estimation is a crucial technology for comprehending scenes, and acquiring global contextual information is pivotal for enhancing depth estimation accuracy. Traditional approaches for incorporating global context information involve pooling feature maps of varying receptive field sizes. Nevertheless, they fail to address challenges such as object boundary distortion and the loss of local detail information caused by complex textures and geometric structures in scenes. To tackle these issues, this paper proposes a novel monocular depth estimation model called CADNet (Context-aggregated DCPPM monocular depth estimation network). This model leverages a multi-scale context aggregation module, DCPPM, to effectively aggregate local features into a global framework, thereby resolving the problem of local detail loss during network training. Experimental results demonstrate that the CADNet model surpasses the NewCRFs model in complex scene boundary detection and capturing local object details. Furthermore, with a 6.27% reduction in parameter count, the CADNet model achieves a noteworthy 9.82% decrease in Sq Rel error on the KITTI dataset and exhibits remarkable performance in general depth estimation metrics for both indoor and outdoor scenes.

KW - Contextual information aggregation

KW - Depth boundaries

KW - Fully connected conditional random field

KW - Monocular depth estimation

KW - Scene comprehension

UR - http://www.scopus.com/inward/record.url?scp=85201207630&partnerID=8YFLogxK

U2 - 10.1109/EEISS62553.2024.00035

DO - 10.1109/EEISS62553.2024.00035

M3 - Conference contribution

AN - SCOPUS:85201207630

T3 - Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024

SP - 161

EP - 165

BT - Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 13 January 2024 through 15 January 2024

ER -

Zhu C, Sun H, Lu M, Zhang F. CADNet: Context-aggregated DCPPM monocular depth estimation network. 在 Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024. Institute of Electrical and Electronics Engineers Inc. 2024. 页码 161-165. (Proceedings - 2024 International Conference on Electronic Engineering and Information Systems, EEISS 2024). doi: 10.1109/EEISS62553.2024.00035

CADNet: Context-aggregated DCPPM monocular depth estimation network

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此