Monocular Depth Estimation with Enhanced Target Awareness for UAV Environment Perception

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Depth perception has draw a lot of attention recently as it enables 3D sensing capabilities for robots. Monocular depth estimation offers cost-effective and computationally efficient solutions, making it an ideal choice for deployment on resource-constrained platforms such as unmanned ariel vehicles (UAVs). However, existing work primarily focuses on indoor reconstruction and ground-based autonomous driving - methods that are not directly applicable to aerial scenarios. Moreover, during UAV flight, particular emphasis is placed on specific entities like targets and obstacles, whereas current methods typically perform global depth estimation and thus fail to satisfy this requirement. To tackle these challenges, this work presents a novel approach that jointly learns outdoor depth and target-region information. Target prior knowledge is injected into multi-scale encoder features through a novel mask-guided feature fusion strategy, sharpening object boundaries without adding extra constraints. A target-prioritized loss amplifies the model's attention on regions of interest. Extensive experiments on the DDOS dataset demonstrate that our method achieves state-of-the-art performance.

Original languageEnglish
Title of host publicationProceedings of the 44th Chinese Control Conference, CCC 2025
EditorsJian Sun, Hongpeng Yin
PublisherIEEE Computer Society
Pages8314-8320
Number of pages7
ISBN (Electronic)9789887581611
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event44th Chinese Control Conference, CCC 2025 - Chongqing, China
Duration: 28 Jul 202530 Jul 2025

Publication series

NameChinese Control Conference, CCC
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference44th Chinese Control Conference, CCC 2025
Country/TerritoryChina
CityChongqing
Period28/07/2530/07/25

Keywords

  • Mask-guided Feature Fusion
  • Monocular Depth Estimation
  • Target Regions
  • Unmanned Aerial Vehicles (UAVs)

Fingerprint

Dive into the research topics of 'Monocular Depth Estimation with Enhanced Target Awareness for UAV Environment Perception'. Together they form a unique fingerprint.

Cite this