Skip to main navigation Skip to search Skip to main content

DFVNet: Density Feature Guided Small Object Detection in Video

  • Junhe Lv
  • , Linwei Chen
  • , Ying Fu*
  • , Jun Yin
  • , Yayun Wang
  • , Chenggang Yan
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • Zhejiang Dahua Technology Co., Ltd.
  • Hangzhou Dianzi University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Detecting small objects is a critical task in various application domains, including autonomous driving and drone recognition. However, existing small object detection methods, which are predominantly based on single images, often fail to capture essential information due to issues such as density and motion blur. These methods also neglect the temporal context inherent in real-world scenarios, resulting in suboptimal detection performance. To address these limitations, this paper proposes a novel video-based Detection Transformer Network, named DFVNet, for small object detection. DFVNet leverages density features to extract information from densely packed s and enhances the current detection frame through a coarse-to-fine refinement process, utilizing reference features from adjacent temporal frames. Extensive experiments on the VisDrone2019-VID dataset show that DFVNet improves average precision AP50 by 3.0% compared to image-based detection methods and by 4.1% over other video-based detection methods.

Original languageEnglish
Title of host publicationICVISP 2025 Proceedings - 2025 9th International Conference on Vision, Image and Signal Processing
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331556822
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event9th International Conference on Vision, Image and Signal Processing, ICVISP 2025 - Xi'an, China
Duration: 28 Nov 202530 Nov 2025

Publication series

NameICVISP 2025 Proceedings - 2025 9th International Conference on Vision, Image and Signal Processing

Conference

Conference9th International Conference on Vision, Image and Signal Processing, ICVISP 2025
Country/TerritoryChina
CityXi'an
Period28/11/2530/11/25

Keywords

  • Density Feature
  • Dynamic Query
  • Multi-Frame Information Fusion
  • Small Object Detection in Video

Fingerprint

Dive into the research topics of 'DFVNet: Density Feature Guided Small Object Detection in Video'. Together they form a unique fingerprint.

Cite this