GCVNet: Geometry Constrained Voting Network to Estimate 3D Pose for Fine-Grained Object Categories

Yaohang Han, Huijun Di*, Hanfeng Zheng, Jianyong Qi, Jianwei Gong

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

As a fundamental AI problem, monocular 3D pose estimation has received much attention. This paper addresses the challenge of estimating full perspective model parameters, including object pose and camera intrinsics, from a single 2D image of fine-grained object categories. To tackle this highly ill-posed problem, we propose a Geometry Constrained Voting Network (GCVNet). It is a unified end-to-end network consisting of four synergic task-specific subnetworks: 1) Fine-grained classification subnetwork, offering fine-grained 3D shape priors. 2) Voting subnetwork, generating 2D measurements. 3) Segmentation subnetwork, providing a foreground mask for voting. 4) PnP subnetwork, estimating the perspective parameters via explicit geometric reasoning, as well as constraining the classification subnetwork to provide proper 3D priors and the voting subnetwork to generate a group of geometric consistent 2D measurements, rather than independent voting for each 2D measurement in the literature. Experiments on challenging datasets demonstrate the superior performance of GCVNet.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision - 3rd Chinese Conference, PRCV 2020, Proceedings
EditorsYuxin Peng, Hongbin Zha, Qingshan Liu, Huchuan Lu, Zhenan Sun, Chenglin Liu, Xilin Chen, Jian Yang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages180-192
Number of pages13
ISBN (Print)9783030606329
DOIs
Publication statusPublished - 2020
Event3rd Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2020 - Nanjing, China
Duration: 16 Oct 202018 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12305 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2020
Country/TerritoryChina
CityNanjing
Period16/10/2018/10/20

Keywords

  • Differentiable PnP
  • Geometric reasoning
  • Pose estimation

Fingerprint

Dive into the research topics of 'GCVNet: Geometry Constrained Voting Network to Estimate 3D Pose for Fine-Grained Object Categories'. Together they form a unique fingerprint.

Cite this