ObjectFusion: Accurate object-level SLAM with neural object priors

Zi Xin Zou; Shi Sheng Huang; Tai Jiang Mu; Yu Ping Wang

doi:10.1016/j.gmod.2022.101165

ObjectFusion: Accurate object-level SLAM with neural object priors

Zi Xin Zou, Shi Sheng Huang, Tai Jiang Mu^*, Yu Ping Wang

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

10 Citations (Scopus)

Abstract

Previous object-level Simultaneous Localization and Mapping (SLAM) approaches still fail to create high quality object-oriented 3D map in an efficient way. The main challenges come from how to represent the object shape effectively and how to apply such object representation to accurate online camera tracking efficiently. In this paper, we provide ObjectFusion as a novel object-level SLAM in static scenes which efficiently creates object-oriented 3D map with high-quality object reconstruction, by leveraging neural object priors. We propose a neural object representation with only a single encoder–decoder network to effectively express the object shape across various categories, which benefits high quality reconstruction of object instance. More importantly, we propose to convert such neural object representation as precise measurements to jointly optimize the object shape, object pose and camera pose for the final accurate 3D object reconstruction. With extensive evaluations on synthetic and real-world RGB-D datasets, we show that our ObjectFusion outperforms previous approaches, with better object reconstruction quality, using much less memory footprint, and in a more efficient way, especially at the object level.

Original language	English
Article number	101165
Journal	Graphical Models
Volume	123
DOIs	https://doi.org/10.1016/j.gmod.2022.101165
Publication status	Published - Sept 2022
Externally published	Yes

Keywords

Deep 3D Representation and Reconstruction
Object-Level SLAM
Online Reconstruction

Access to Document

10.1016/j.gmod.2022.101165

Cite this

Zou, Z. X., Huang, S. S., Mu, T. J., & Wang, Y. P. (2022). ObjectFusion: Accurate object-level SLAM with neural object priors. Graphical Models, 123, Article 101165. https://doi.org/10.1016/j.gmod.2022.101165

@article{80e32259d1d94c0d961c3c5e73f906fa,

title = "ObjectFusion: Accurate object-level SLAM with neural object priors",

abstract = "Previous object-level Simultaneous Localization and Mapping (SLAM) approaches still fail to create high quality object-oriented 3D map in an efficient way. The main challenges come from how to represent the object shape effectively and how to apply such object representation to accurate online camera tracking efficiently. In this paper, we provide ObjectFusion as a novel object-level SLAM in static scenes which efficiently creates object-oriented 3D map with high-quality object reconstruction, by leveraging neural object priors. We propose a neural object representation with only a single encoder–decoder network to effectively express the object shape across various categories, which benefits high quality reconstruction of object instance. More importantly, we propose to convert such neural object representation as precise measurements to jointly optimize the object shape, object pose and camera pose for the final accurate 3D object reconstruction. With extensive evaluations on synthetic and real-world RGB-D datasets, we show that our ObjectFusion outperforms previous approaches, with better object reconstruction quality, using much less memory footprint, and in a more efficient way, especially at the object level.",

keywords = "Deep 3D Representation and Reconstruction, Object-Level SLAM, Online Reconstruction",

author = "Zou, {Zi Xin} and Huang, {Shi Sheng} and Mu, {Tai Jiang} and Wang, {Yu Ping}",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier Inc.",

year = "2022",

month = sep,

doi = "10.1016/j.gmod.2022.101165",

language = "English",

volume = "123",

journal = "Graphical Models",

issn = "1524-0703",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - ObjectFusion

T2 - Accurate object-level SLAM with neural object priors

AU - Zou, Zi Xin

AU - Huang, Shi Sheng

AU - Mu, Tai Jiang

AU - Wang, Yu Ping

PY - 2022/9

Y1 - 2022/9

N2 - Previous object-level Simultaneous Localization and Mapping (SLAM) approaches still fail to create high quality object-oriented 3D map in an efficient way. The main challenges come from how to represent the object shape effectively and how to apply such object representation to accurate online camera tracking efficiently. In this paper, we provide ObjectFusion as a novel object-level SLAM in static scenes which efficiently creates object-oriented 3D map with high-quality object reconstruction, by leveraging neural object priors. We propose a neural object representation with only a single encoder–decoder network to effectively express the object shape across various categories, which benefits high quality reconstruction of object instance. More importantly, we propose to convert such neural object representation as precise measurements to jointly optimize the object shape, object pose and camera pose for the final accurate 3D object reconstruction. With extensive evaluations on synthetic and real-world RGB-D datasets, we show that our ObjectFusion outperforms previous approaches, with better object reconstruction quality, using much less memory footprint, and in a more efficient way, especially at the object level.

AB - Previous object-level Simultaneous Localization and Mapping (SLAM) approaches still fail to create high quality object-oriented 3D map in an efficient way. The main challenges come from how to represent the object shape effectively and how to apply such object representation to accurate online camera tracking efficiently. In this paper, we provide ObjectFusion as a novel object-level SLAM in static scenes which efficiently creates object-oriented 3D map with high-quality object reconstruction, by leveraging neural object priors. We propose a neural object representation with only a single encoder–decoder network to effectively express the object shape across various categories, which benefits high quality reconstruction of object instance. More importantly, we propose to convert such neural object representation as precise measurements to jointly optimize the object shape, object pose and camera pose for the final accurate 3D object reconstruction. With extensive evaluations on synthetic and real-world RGB-D datasets, we show that our ObjectFusion outperforms previous approaches, with better object reconstruction quality, using much less memory footprint, and in a more efficient way, especially at the object level.

KW - Deep 3D Representation and Reconstruction

KW - Object-Level SLAM

KW - Online Reconstruction

UR - http://www.scopus.com/inward/record.url?scp=85135828135&partnerID=8YFLogxK

U2 - 10.1016/j.gmod.2022.101165

DO - 10.1016/j.gmod.2022.101165

M3 - Article

AN - SCOPUS:85135828135

SN - 1524-0703

VL - 123

JO - Graphical Models

JF - Graphical Models

M1 - 101165

ER -

ObjectFusion: Accurate object-level SLAM with neural object priors

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this