DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation

Junzhe Jiang; Cheng Xu; Hongzhe Liu; Ying Fu; Muwei Jian

doi:10.3390/electronics12194059

DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation

Junzhe Jiang, Cheng Xu, Hongzhe Liu^*, Ying Fu, Muwei Jian

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

Abstract

With a larger field of view (FOV) than ordinary images, fisheye images are becoming mainstream in the field of autonomous driving. However, the severe distortion problem of fisheye images also limits its application. The performance of neural networks designed for narrow FOV images degrades drastically for fisheye images, and the use of large composite models can improve the performance, but it brings huge time overhead and hardware costs. Therefore, we decided to balance real time and accuracy by designing the deformable segmentation attention(DSA) module, a generalpurpose architecture based on a deformable attention mechanism and a spatial pyramid architecture. The deformable mechanism serves to accurately extract feature information from fisheye images, together with attention to learn the global context and the spatial pyramid structure to balance multiscale feature information, thus improving the perception of fisheye images by traditional networks without increasing the amount of excessive computation. Lightweight networks such as SegNeXt equipped with the DSA module enable effective and rapid multi-scale segmentation of fisheye images in complex scenes. Our architecture achieves outstanding results on the WoodScape dataset, while our ablation experiments demonstrate the effectiveness of various parts of the architecture.

Original language	English
Article number	4059
Journal	Electronics (Switzerland)
Volume	12
Issue number	19
DOIs	https://doi.org/10.3390/electronics12194059
Publication status	Published - Oct 2023

Keywords

deformable attention
fisheye image segmentation
spatial pyramid

Access to Document

10.3390/electronics12194059

Cite this

@article{6f2dc08268e9471597effb9351c38c89,

title = "DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation",

abstract = "With a larger field of view (FOV) than ordinary images, fisheye images are becoming mainstream in the field of autonomous driving. However, the severe distortion problem of fisheye images also limits its application. The performance of neural networks designed for narrow FOV images degrades drastically for fisheye images, and the use of large composite models can improve the performance, but it brings huge time overhead and hardware costs. Therefore, we decided to balance real time and accuracy by designing the deformable segmentation attention(DSA) module, a generalpurpose architecture based on a deformable attention mechanism and a spatial pyramid architecture. The deformable mechanism serves to accurately extract feature information from fisheye images, together with attention to learn the global context and the spatial pyramid structure to balance multiscale feature information, thus improving the perception of fisheye images by traditional networks without increasing the amount of excessive computation. Lightweight networks such as SegNeXt equipped with the DSA module enable effective and rapid multi-scale segmentation of fisheye images in complex scenes. Our architecture achieves outstanding results on the WoodScape dataset, while our ablation experiments demonstrate the effectiveness of various parts of the architecture.",

keywords = "deformable attention, fisheye image segmentation, spatial pyramid",

author = "Junzhe Jiang and Cheng Xu and Hongzhe Liu and Ying Fu and Muwei Jian",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = oct,

doi = "10.3390/electronics12194059",

language = "English",

volume = "12",

journal = "Electronics (Switzerland)",

issn = "2079-9292",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "19",

}

TY - JOUR

T1 - DSA

T2 - Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation

AU - Jiang, Junzhe

AU - Xu, Cheng

AU - Liu, Hongzhe

AU - Fu, Ying

AU - Jian, Muwei

PY - 2023/10

Y1 - 2023/10

N2 - With a larger field of view (FOV) than ordinary images, fisheye images are becoming mainstream in the field of autonomous driving. However, the severe distortion problem of fisheye images also limits its application. The performance of neural networks designed for narrow FOV images degrades drastically for fisheye images, and the use of large composite models can improve the performance, but it brings huge time overhead and hardware costs. Therefore, we decided to balance real time and accuracy by designing the deformable segmentation attention(DSA) module, a generalpurpose architecture based on a deformable attention mechanism and a spatial pyramid architecture. The deformable mechanism serves to accurately extract feature information from fisheye images, together with attention to learn the global context and the spatial pyramid structure to balance multiscale feature information, thus improving the perception of fisheye images by traditional networks without increasing the amount of excessive computation. Lightweight networks such as SegNeXt equipped with the DSA module enable effective and rapid multi-scale segmentation of fisheye images in complex scenes. Our architecture achieves outstanding results on the WoodScape dataset, while our ablation experiments demonstrate the effectiveness of various parts of the architecture.

AB - With a larger field of view (FOV) than ordinary images, fisheye images are becoming mainstream in the field of autonomous driving. However, the severe distortion problem of fisheye images also limits its application. The performance of neural networks designed for narrow FOV images degrades drastically for fisheye images, and the use of large composite models can improve the performance, but it brings huge time overhead and hardware costs. Therefore, we decided to balance real time and accuracy by designing the deformable segmentation attention(DSA) module, a generalpurpose architecture based on a deformable attention mechanism and a spatial pyramid architecture. The deformable mechanism serves to accurately extract feature information from fisheye images, together with attention to learn the global context and the spatial pyramid structure to balance multiscale feature information, thus improving the perception of fisheye images by traditional networks without increasing the amount of excessive computation. Lightweight networks such as SegNeXt equipped with the DSA module enable effective and rapid multi-scale segmentation of fisheye images in complex scenes. Our architecture achieves outstanding results on the WoodScape dataset, while our ablation experiments demonstrate the effectiveness of various parts of the architecture.

KW - deformable attention

KW - fisheye image segmentation

KW - spatial pyramid

UR - http://www.scopus.com/inward/record.url?scp=85173898431&partnerID=8YFLogxK

U2 - 10.3390/electronics12194059

DO - 10.3390/electronics12194059

M3 - Article

AN - SCOPUS:85173898431

SN - 2079-9292

VL - 12

JO - Electronics (Switzerland)

JF - Electronics (Switzerland)

IS - 19

M1 - 4059

ER -

DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this