Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN

Xiaoyang Chen; Chunfeng Lian; Hannah H. Deng; Tianshu Kuang; Hung Ying Lin; Deqiang Xiao; Jaime Gateno; Dinggang Shen; James J. Xia; Pew Thian Yap

doi:10.1109/TMI.2021.3099509

Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN

Xiaoyang Chen^*, Chunfeng Lian, Hannah H. Deng, Tianshu Kuang, Hung Ying Lin, Deqiang Xiao, Jaime Gateno, Dinggang Shen, James J. Xia, Pew Thian Yap

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

42 引用（Scopus）

摘要

Automatic craniomaxillofacial (CMF) landmark localization from cone-beam computed tomography (CBCT) images is challenging, considering that 1) the number of landmarks in the images may change due to varying deformities and traumatic defects, and 2) the CBCT images used in clinical practice are typically large. In this paper, we propose a two-stage, coarse-to-fine deep learning method to tackle these challenges with both speed and accuracy in mind. Specifically, we first use a 3D faster R-CNN to roughly locate landmarks in down-sampled CBCT images that have varying numbers of landmarks. By converting the landmark point detection problem to a generic object detection problem, our 3D faster R-CNN is formulated to detect virtual, fixed-size objects in small boxes with centers indicating the approximate locations of the landmarks. Based on the rough landmark locations, we then crop 3D patches from the high-resolution images and send them to a multi-scale UNet for the regression of heatmaps, from which the refined landmark locations are finally derived. We evaluated the proposed approach by detecting up to 18 landmarks on a real clinical dataset of CMF CBCT images with various conditions. Experiments show that our approach achieves state-of-the-art accuracy of 0.89 ± 0.64mm in an average time of 26.2 seconds per volume.

源语言	英语
页（从-至）	3867-3878
页数	12
期刊	IEEE Transactions on Medical Imaging
卷	40
期	12
DOI	https://doi.org/10.1109/TMI.2021.3099509
出版状态	已出版 - 1 12月 2021
已对外发布	是

访问文件

10.1109/TMI.2021.3099509

其它文件与链接

链接到 Scopus 的出版物

引用此

Chen, X., Lian, C., Deng, H. H., Kuang, T., Lin, H. Y., Xiao, D., Gateno, J., Shen, D., Xia, J. J., & Yap, P. T. (2021). Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN. IEEE Transactions on Medical Imaging, 40(12), 3867-3878. https://doi.org/10.1109/TMI.2021.3099509

@article{e2d6e1f391fa4f41b3a081897d0244fd,

title = "Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN",

abstract = "Automatic craniomaxillofacial (CMF) landmark localization from cone-beam computed tomography (CBCT) images is challenging, considering that 1) the number of landmarks in the images may change due to varying deformities and traumatic defects, and 2) the CBCT images used in clinical practice are typically large. In this paper, we propose a two-stage, coarse-to-fine deep learning method to tackle these challenges with both speed and accuracy in mind. Specifically, we first use a 3D faster R-CNN to roughly locate landmarks in down-sampled CBCT images that have varying numbers of landmarks. By converting the landmark point detection problem to a generic object detection problem, our 3D faster R-CNN is formulated to detect virtual, fixed-size objects in small boxes with centers indicating the approximate locations of the landmarks. Based on the rough landmark locations, we then crop 3D patches from the high-resolution images and send them to a multi-scale UNet for the regression of heatmaps, from which the refined landmark locations are finally derived. We evaluated the proposed approach by detecting up to 18 landmarks on a real clinical dataset of CMF CBCT images with various conditions. Experiments show that our approach achieves state-of-the-art accuracy of 0.89 ± 0.64mm in an average time of 26.2 seconds per volume.",

keywords = "3D faster R-CNN, CBCT, CMF, Landmark detection, heatmap regression",

author = "Xiaoyang Chen and Chunfeng Lian and Deng, {Hannah H.} and Tianshu Kuang and Lin, {Hung Ying} and Deqiang Xiao and Jaime Gateno and Dinggang Shen and Xia, {James J.} and Yap, {Pew Thian}",

note = "Publisher Copyright: {\textcopyright} 1982-2012 IEEE.",

year = "2021",

month = dec,

day = "1",

doi = "10.1109/TMI.2021.3099509",

language = "English",

volume = "40",

pages = "3867--3878",

journal = "IEEE Transactions on Medical Imaging",

issn = "0278-0062",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "12",

}

TY - JOUR

T1 - Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN

AU - Chen, Xiaoyang

AU - Lian, Chunfeng

AU - Deng, Hannah H.

AU - Kuang, Tianshu

AU - Lin, Hung Ying

AU - Xiao, Deqiang

AU - Gateno, Jaime

AU - Shen, Dinggang

AU - Xia, James J.

AU - Yap, Pew Thian

PY - 2021/12/1

Y1 - 2021/12/1

N2 - Automatic craniomaxillofacial (CMF) landmark localization from cone-beam computed tomography (CBCT) images is challenging, considering that 1) the number of landmarks in the images may change due to varying deformities and traumatic defects, and 2) the CBCT images used in clinical practice are typically large. In this paper, we propose a two-stage, coarse-to-fine deep learning method to tackle these challenges with both speed and accuracy in mind. Specifically, we first use a 3D faster R-CNN to roughly locate landmarks in down-sampled CBCT images that have varying numbers of landmarks. By converting the landmark point detection problem to a generic object detection problem, our 3D faster R-CNN is formulated to detect virtual, fixed-size objects in small boxes with centers indicating the approximate locations of the landmarks. Based on the rough landmark locations, we then crop 3D patches from the high-resolution images and send them to a multi-scale UNet for the regression of heatmaps, from which the refined landmark locations are finally derived. We evaluated the proposed approach by detecting up to 18 landmarks on a real clinical dataset of CMF CBCT images with various conditions. Experiments show that our approach achieves state-of-the-art accuracy of 0.89 ± 0.64mm in an average time of 26.2 seconds per volume.

AB - Automatic craniomaxillofacial (CMF) landmark localization from cone-beam computed tomography (CBCT) images is challenging, considering that 1) the number of landmarks in the images may change due to varying deformities and traumatic defects, and 2) the CBCT images used in clinical practice are typically large. In this paper, we propose a two-stage, coarse-to-fine deep learning method to tackle these challenges with both speed and accuracy in mind. Specifically, we first use a 3D faster R-CNN to roughly locate landmarks in down-sampled CBCT images that have varying numbers of landmarks. By converting the landmark point detection problem to a generic object detection problem, our 3D faster R-CNN is formulated to detect virtual, fixed-size objects in small boxes with centers indicating the approximate locations of the landmarks. Based on the rough landmark locations, we then crop 3D patches from the high-resolution images and send them to a multi-scale UNet for the regression of heatmaps, from which the refined landmark locations are finally derived. We evaluated the proposed approach by detecting up to 18 landmarks on a real clinical dataset of CMF CBCT images with various conditions. Experiments show that our approach achieves state-of-the-art accuracy of 0.89 ± 0.64mm in an average time of 26.2 seconds per volume.

KW - 3D faster R-CNN

KW - CBCT

KW - CMF

KW - Landmark detection

KW - heatmap regression

UR - http://www.scopus.com/inward/record.url?scp=85112643818&partnerID=8YFLogxK

U2 - 10.1109/TMI.2021.3099509

DO - 10.1109/TMI.2021.3099509

M3 - Article

C2 - 34310293

AN - SCOPUS:85112643818

SN - 0278-0062

VL - 40

SP - 3867

EP - 3878

JO - IEEE Transactions on Medical Imaging

JF - IEEE Transactions on Medical Imaging

IS - 12

ER -

Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN

摘要

访问文件

其它文件与链接

指纹

引用此