Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN

Xiaoyang Chen; Chunfeng Lian; Hannah H. Deng; Tianshu Kuang; Hung Ying Lin; Deqiang Xiao; Jaime Gateno; Dinggang Shen; James J. Xia; Pew Thian Yap

doi:10.1109/TMI.2021.3099509

Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN

Xiaoyang Chen^*, Chunfeng Lian, Hannah H. Deng, Tianshu Kuang, Hung Ying Lin, Deqiang Xiao, Jaime Gateno, Dinggang Shen, James J. Xia, Pew Thian Yap

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

34 Citations (Scopus)

Abstract

Automatic craniomaxillofacial (CMF) landmark localization from cone-beam computed tomography (CBCT) images is challenging, considering that 1) the number of landmarks in the images may change due to varying deformities and traumatic defects, and 2) the CBCT images used in clinical practice are typically large. In this paper, we propose a two-stage, coarse-to-fine deep learning method to tackle these challenges with both speed and accuracy in mind. Specifically, we first use a 3D faster R-CNN to roughly locate landmarks in down-sampled CBCT images that have varying numbers of landmarks. By converting the landmark point detection problem to a generic object detection problem, our 3D faster R-CNN is formulated to detect virtual, fixed-size objects in small boxes with centers indicating the approximate locations of the landmarks. Based on the rough landmark locations, we then crop 3D patches from the high-resolution images and send them to a multi-scale UNet for the regression of heatmaps, from which the refined landmark locations are finally derived. We evaluated the proposed approach by detecting up to 18 landmarks on a real clinical dataset of CMF CBCT images with various conditions. Experiments show that our approach achieves state-of-the-art accuracy of 0.89 ± 0.64mm in an average time of 26.2 seconds per volume.

Original language	English
Pages (from-to)	3867-3878
Number of pages	12
Journal	IEEE Transactions on Medical Imaging
Volume	40
Issue number	12
DOIs	https://doi.org/10.1109/TMI.2021.3099509
Publication status	Published - 1 Dec 2021
Externally published	Yes

Keywords

3D faster R-CNN
CBCT
CMF
Landmark detection
heatmap regression

Access to Document

10.1109/TMI.2021.3099509

Cite this

@article{e2d6e1f391fa4f41b3a081897d0244fd,

title = "Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN",

abstract = "Automatic craniomaxillofacial (CMF) landmark localization from cone-beam computed tomography (CBCT) images is challenging, considering that 1) the number of landmarks in the images may change due to varying deformities and traumatic defects, and 2) the CBCT images used in clinical practice are typically large. In this paper, we propose a two-stage, coarse-to-fine deep learning method to tackle these challenges with both speed and accuracy in mind. Specifically, we first use a 3D faster R-CNN to roughly locate landmarks in down-sampled CBCT images that have varying numbers of landmarks. By converting the landmark point detection problem to a generic object detection problem, our 3D faster R-CNN is formulated to detect virtual, fixed-size objects in small boxes with centers indicating the approximate locations of the landmarks. Based on the rough landmark locations, we then crop 3D patches from the high-resolution images and send them to a multi-scale UNet for the regression of heatmaps, from which the refined landmark locations are finally derived. We evaluated the proposed approach by detecting up to 18 landmarks on a real clinical dataset of CMF CBCT images with various conditions. Experiments show that our approach achieves state-of-the-art accuracy of 0.89 ± 0.64mm in an average time of 26.2 seconds per volume.",

keywords = "3D faster R-CNN, CBCT, CMF, Landmark detection, heatmap regression",

author = "Xiaoyang Chen and Chunfeng Lian and Deng, {Hannah H.} and Tianshu Kuang and Lin, {Hung Ying} and Deqiang Xiao and Jaime Gateno and Dinggang Shen and Xia, {James J.} and Yap, {Pew Thian}",

note = "Publisher Copyright: {\textcopyright} 1982-2012 IEEE.",

year = "2021",

month = dec,

day = "1",

doi = "10.1109/TMI.2021.3099509",

language = "English",

volume = "40",

pages = "3867--3878",

journal = "IEEE Transactions on Medical Imaging",

issn = "0278-0062",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "12",

}

TY - JOUR

T1 - Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN

AU - Chen, Xiaoyang

AU - Lian, Chunfeng

AU - Deng, Hannah H.

AU - Kuang, Tianshu

AU - Lin, Hung Ying

AU - Xiao, Deqiang

AU - Gateno, Jaime

AU - Shen, Dinggang

AU - Xia, James J.

AU - Yap, Pew Thian

PY - 2021/12/1

Y1 - 2021/12/1

N2 - Automatic craniomaxillofacial (CMF) landmark localization from cone-beam computed tomography (CBCT) images is challenging, considering that 1) the number of landmarks in the images may change due to varying deformities and traumatic defects, and 2) the CBCT images used in clinical practice are typically large. In this paper, we propose a two-stage, coarse-to-fine deep learning method to tackle these challenges with both speed and accuracy in mind. Specifically, we first use a 3D faster R-CNN to roughly locate landmarks in down-sampled CBCT images that have varying numbers of landmarks. By converting the landmark point detection problem to a generic object detection problem, our 3D faster R-CNN is formulated to detect virtual, fixed-size objects in small boxes with centers indicating the approximate locations of the landmarks. Based on the rough landmark locations, we then crop 3D patches from the high-resolution images and send them to a multi-scale UNet for the regression of heatmaps, from which the refined landmark locations are finally derived. We evaluated the proposed approach by detecting up to 18 landmarks on a real clinical dataset of CMF CBCT images with various conditions. Experiments show that our approach achieves state-of-the-art accuracy of 0.89 ± 0.64mm in an average time of 26.2 seconds per volume.

AB - Automatic craniomaxillofacial (CMF) landmark localization from cone-beam computed tomography (CBCT) images is challenging, considering that 1) the number of landmarks in the images may change due to varying deformities and traumatic defects, and 2) the CBCT images used in clinical practice are typically large. In this paper, we propose a two-stage, coarse-to-fine deep learning method to tackle these challenges with both speed and accuracy in mind. Specifically, we first use a 3D faster R-CNN to roughly locate landmarks in down-sampled CBCT images that have varying numbers of landmarks. By converting the landmark point detection problem to a generic object detection problem, our 3D faster R-CNN is formulated to detect virtual, fixed-size objects in small boxes with centers indicating the approximate locations of the landmarks. Based on the rough landmark locations, we then crop 3D patches from the high-resolution images and send them to a multi-scale UNet for the regression of heatmaps, from which the refined landmark locations are finally derived. We evaluated the proposed approach by detecting up to 18 landmarks on a real clinical dataset of CMF CBCT images with various conditions. Experiments show that our approach achieves state-of-the-art accuracy of 0.89 ± 0.64mm in an average time of 26.2 seconds per volume.

KW - 3D faster R-CNN

KW - CBCT

KW - CMF

KW - Landmark detection

KW - heatmap regression

UR - http://www.scopus.com/inward/record.url?scp=85112643818&partnerID=8YFLogxK

U2 - 10.1109/TMI.2021.3099509

DO - 10.1109/TMI.2021.3099509

M3 - Article

C2 - 34310293

AN - SCOPUS:85112643818

SN - 0278-0062

VL - 40

SP - 3867

EP - 3878

JO - IEEE Transactions on Medical Imaging

JF - IEEE Transactions on Medical Imaging

IS - 12

ER -

Fast and Accurate Craniomaxillofacial Landmark Detection via 3D Faster R-CNN

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this