Ship detection under complex backgrounds based on accurate rotated anchor boxes from paired semantic segmentation

Xiaowu Xiao; Zhiqiang Zhou; Bo Wang; Linhao Li; Lingjuan Miao

doi:10.3390/rs11212506

Ship detection under complex backgrounds based on accurate rotated anchor boxes from paired semantic segmentation

Xiaowu Xiao, Zhiqiang Zhou^*, Bo Wang, Linhao Li, Lingjuan Miao

^*此作品的通讯作者

自动化学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

23 引用（Scopus）

摘要

It is still challenging to effectively detect ship objects in optical remote-sensing images with complex backgrounds. Many current CNN-based one-stage and two-stage detection methods usually first predefine a series of anchors with various scales, aspect ratios and angles, and then the detection results can be outputted by performing once or twice classification and bounding box regression for predefined anchors. However, most of the defined anchors have relatively low accuracy, and are useless for the following classification and regression. In addition, the preset anchors are not robust to produce good performance for other different detection datasets. To avoid the above problems, in this paper we design a paired semantic segmentation network to generate more accurate rotated anchors with smaller numbers. Specifically, the paired segmentation network predicts four parts (i.e., top-left, bottom-right, top-right, and bottom-left parts) of ships. By combining paired top-left and bottom-right parts (or top-right and bottom-left parts), we can take the minimum bounding box of these two parts as the rotated anchor. This way can be more robust to different ship datasets, and the generated anchors are more accurate and have fewer numbers. Furthermore, to effectively use fine-scale detail information and coarse-scale semantic information, we use the magnified convolutional features to classify and regress the generated rotated anchors. Meanwhile, the horizontal minimum bounding box of the rotated anchor is also used to combine more context information. We compare the proposed algorithm with state-of-the-art object-detection methods for natural images and ship-detection methods, and demonstrate the superiority of our method.

源语言	英语
文章编号	2506
期刊	Remote Sensing
卷	11
期	21
DOI	https://doi.org/10.3390/rs11212506
出版状态	已出版 - 1 11月 2019

访问文件

10.3390/rs11212506

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{81cd5453939444eb98ed92ff7232d23c,

title = "Ship detection under complex backgrounds based on accurate rotated anchor boxes from paired semantic segmentation",

abstract = "It is still challenging to effectively detect ship objects in optical remote-sensing images with complex backgrounds. Many current CNN-based one-stage and two-stage detection methods usually first predefine a series of anchors with various scales, aspect ratios and angles, and then the detection results can be outputted by performing once or twice classification and bounding box regression for predefined anchors. However, most of the defined anchors have relatively low accuracy, and are useless for the following classification and regression. In addition, the preset anchors are not robust to produce good performance for other different detection datasets. To avoid the above problems, in this paper we design a paired semantic segmentation network to generate more accurate rotated anchors with smaller numbers. Specifically, the paired segmentation network predicts four parts (i.e., top-left, bottom-right, top-right, and bottom-left parts) of ships. By combining paired top-left and bottom-right parts (or top-right and bottom-left parts), we can take the minimum bounding box of these two parts as the rotated anchor. This way can be more robust to different ship datasets, and the generated anchors are more accurate and have fewer numbers. Furthermore, to effectively use fine-scale detail information and coarse-scale semantic information, we use the magnified convolutional features to classify and regress the generated rotated anchors. Meanwhile, the horizontal minimum bounding box of the rotated anchor is also used to combine more context information. We compare the proposed algorithm with state-of-the-art object-detection methods for natural images and ship-detection methods, and demonstrate the superiority of our method.",

keywords = "Context information, Convolutional neural networks, Magnified convolutional features, Paired semantic segmentation, Ship detection",

author = "Xiaowu Xiao and Zhiqiang Zhou and Bo Wang and Linhao Li and Lingjuan Miao",

note = "Publisher Copyright: {\textcopyright} 2019 by the authors.",

year = "2019",

month = nov,

day = "1",

doi = "10.3390/rs11212506",

language = "English",

volume = "11",

journal = "Remote Sensing",

issn = "2072-4292",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "21",

}

TY - JOUR

T1 - Ship detection under complex backgrounds based on accurate rotated anchor boxes from paired semantic segmentation

AU - Xiao, Xiaowu

AU - Zhou, Zhiqiang

AU - Wang, Bo

AU - Li, Linhao

AU - Miao, Lingjuan

PY - 2019/11/1

Y1 - 2019/11/1

N2 - It is still challenging to effectively detect ship objects in optical remote-sensing images with complex backgrounds. Many current CNN-based one-stage and two-stage detection methods usually first predefine a series of anchors with various scales, aspect ratios and angles, and then the detection results can be outputted by performing once or twice classification and bounding box regression for predefined anchors. However, most of the defined anchors have relatively low accuracy, and are useless for the following classification and regression. In addition, the preset anchors are not robust to produce good performance for other different detection datasets. To avoid the above problems, in this paper we design a paired semantic segmentation network to generate more accurate rotated anchors with smaller numbers. Specifically, the paired segmentation network predicts four parts (i.e., top-left, bottom-right, top-right, and bottom-left parts) of ships. By combining paired top-left and bottom-right parts (or top-right and bottom-left parts), we can take the minimum bounding box of these two parts as the rotated anchor. This way can be more robust to different ship datasets, and the generated anchors are more accurate and have fewer numbers. Furthermore, to effectively use fine-scale detail information and coarse-scale semantic information, we use the magnified convolutional features to classify and regress the generated rotated anchors. Meanwhile, the horizontal minimum bounding box of the rotated anchor is also used to combine more context information. We compare the proposed algorithm with state-of-the-art object-detection methods for natural images and ship-detection methods, and demonstrate the superiority of our method.

AB - It is still challenging to effectively detect ship objects in optical remote-sensing images with complex backgrounds. Many current CNN-based one-stage and two-stage detection methods usually first predefine a series of anchors with various scales, aspect ratios and angles, and then the detection results can be outputted by performing once or twice classification and bounding box regression for predefined anchors. However, most of the defined anchors have relatively low accuracy, and are useless for the following classification and regression. In addition, the preset anchors are not robust to produce good performance for other different detection datasets. To avoid the above problems, in this paper we design a paired semantic segmentation network to generate more accurate rotated anchors with smaller numbers. Specifically, the paired segmentation network predicts four parts (i.e., top-left, bottom-right, top-right, and bottom-left parts) of ships. By combining paired top-left and bottom-right parts (or top-right and bottom-left parts), we can take the minimum bounding box of these two parts as the rotated anchor. This way can be more robust to different ship datasets, and the generated anchors are more accurate and have fewer numbers. Furthermore, to effectively use fine-scale detail information and coarse-scale semantic information, we use the magnified convolutional features to classify and regress the generated rotated anchors. Meanwhile, the horizontal minimum bounding box of the rotated anchor is also used to combine more context information. We compare the proposed algorithm with state-of-the-art object-detection methods for natural images and ship-detection methods, and demonstrate the superiority of our method.

KW - Context information

KW - Convolutional neural networks

KW - Magnified convolutional features

KW - Paired semantic segmentation

KW - Ship detection

UR - http://www.scopus.com/inward/record.url?scp=85074672095&partnerID=8YFLogxK

U2 - 10.3390/rs11212506

DO - 10.3390/rs11212506

M3 - Article

AN - SCOPUS:85074672095

SN - 2072-4292

VL - 11

JO - Remote Sensing

JF - Remote Sensing

IS - 21

M1 - 2506

ER -

Ship detection under complex backgrounds based on accurate rotated anchor boxes from paired semantic segmentation

摘要

访问文件

其它文件与链接

指纹

引用此