Semantic Object-Level Modeling for Robust Visual Camera Relocalization

Yifan Zhu, Lingjuan Miao, Haitao Wu*, Zhiqiang Zhou, Weiyi Chen, Longwen Wu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Visual relocalization is crucial for autonomous visual localization and navigation of mobile robotics. Due to the improvement of CNN-based object detection algorithm, the robustness of visual relocalization is greatly enhanced especially in viewpoints where classical methods fail. However, ellipsoids (quadrics) generated by axis-aligned object detection may limit the accuracy of the object-level representation and degenerate the performance of visual relocalization system. In this paper, we propose a novel method of automatic object-level voxel modeling for accurate ellipsoidal representations of objects. As for visual relocalization, we design a better pose optimization strategy for camera pose recovery, to fully utilize the projection characteristics of 2D fitted ellipses and the 3D accurate ellipsoids. All of these modules are entirely intergrated into visual SLAM system. Experimental results show that our semantic object-level mapping and object-based visual relocalization methods significantly enhance the performance of visual relocalization in terms of robustness to new viewpoints.

Original languageEnglish
Title of host publicationProceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5494-5500
Number of pages7
ISBN (Electronic)9798350387780
DOIs
Publication statusPublished - 2024
Event36th Chinese Control and Decision Conference, CCDC 2024 - Xi'an, China
Duration: 25 May 202427 May 2024

Publication series

NameProceedings of the 36th Chinese Control and Decision Conference, CCDC 2024

Conference

Conference36th Chinese Control and Decision Conference, CCDC 2024
Country/TerritoryChina
CityXi'an
Period25/05/2427/05/24

Keywords

  • ellipsoidal model
  • instance segmentation
  • object-level mapping
  • SLAM
  • visual relocalization

Fingerprint

Dive into the research topics of 'Semantic Object-Level Modeling for Robust Visual Camera Relocalization'. Together they form a unique fingerprint.

Cite this