DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation

Junzhe Jiang, Cheng Xu, Hongzhe Liu*, Ying Fu, Muwei Jian

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

With a larger field of view (FOV) than ordinary images, fisheye images are becoming mainstream in the field of autonomous driving. However, the severe distortion problem of fisheye images also limits its application. The performance of neural networks designed for narrow FOV images degrades drastically for fisheye images, and the use of large composite models can improve the performance, but it brings huge time overhead and hardware costs. Therefore, we decided to balance real time and accuracy by designing the deformable segmentation attention(DSA) module, a generalpurpose architecture based on a deformable attention mechanism and a spatial pyramid architecture. The deformable mechanism serves to accurately extract feature information from fisheye images, together with attention to learn the global context and the spatial pyramid structure to balance multiscale feature information, thus improving the perception of fisheye images by traditional networks without increasing the amount of excessive computation. Lightweight networks such as SegNeXt equipped with the DSA module enable effective and rapid multi-scale segmentation of fisheye images in complex scenes. Our architecture achieves outstanding results on the WoodScape dataset, while our ablation experiments demonstrate the effectiveness of various parts of the architecture.

Original languageEnglish
Article number4059
JournalElectronics (Switzerland)
Volume12
Issue number19
DOIs
Publication statusPublished - Oct 2023

Keywords

  • deformable attention
  • fisheye image segmentation
  • spatial pyramid

Fingerprint

Dive into the research topics of 'DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation'. Together they form a unique fingerprint.

Cite this