TY - JOUR
T1 - A Multistage Refinement Network for Salient Object Detection
AU - Zhang, Lihe
AU - Wu, Jie
AU - Wang, Tiantian
AU - Borji, Ali
AU - Wei, Guohua
AU - Lu, Huchuan
N1 - Publisher Copyright:
© 1992-2012 IEEE.
PY - 2020
Y1 - 2020
N2 - Deep convolutional neural networks (CNNs) have been successfully applied to a wide variety of problems in computer vision, including salient object detection. To accurately detect and segment salient objects, it is necessary to extract and combine high-level semantic features with low-level fine details simultaneously. This is challenging for CNNs because repeated subsampling operations such as pooling and convolution lead to a significant decrease in the feature resolution, which results in the loss of spatial details and finer structures. Therefore, we propose augmenting feedforward neural networks by using the multistage refinement mechanism. In the first stage, a master net is built to generate a coarse prediction map in which most detailed structures are missing. In the following stages, the refinement net with layerwise recurrent connections to the master net is equipped to progressively combine local context information across stages to refine the preceding saliency maps in a stagewise manner. Furthermore, the pyramid pooling module and channel attention module are applied to aggregate different-region-based global contexts. Extensive evaluations over six benchmark datasets show that the proposed method performs favorably against the state-of-the-art approaches.
AB - Deep convolutional neural networks (CNNs) have been successfully applied to a wide variety of problems in computer vision, including salient object detection. To accurately detect and segment salient objects, it is necessary to extract and combine high-level semantic features with low-level fine details simultaneously. This is challenging for CNNs because repeated subsampling operations such as pooling and convolution lead to a significant decrease in the feature resolution, which results in the loss of spatial details and finer structures. Therefore, we propose augmenting feedforward neural networks by using the multistage refinement mechanism. In the first stage, a master net is built to generate a coarse prediction map in which most detailed structures are missing. In the following stages, the refinement net with layerwise recurrent connections to the master net is equipped to progressively combine local context information across stages to refine the preceding saliency maps in a stagewise manner. Furthermore, the pyramid pooling module and channel attention module are applied to aggregate different-region-based global contexts. Extensive evaluations over six benchmark datasets show that the proposed method performs favorably against the state-of-the-art approaches.
KW - Salient object detection
KW - channel attention
KW - global average pooling
KW - layerwise recurrent
KW - pyramid pooling module
KW - stagewise refinement
UR - http://www.scopus.com/inward/record.url?scp=85079632276&partnerID=8YFLogxK
U2 - 10.1109/TIP.2019.2962688
DO - 10.1109/TIP.2019.2962688
M3 - Article
AN - SCOPUS:85079632276
SN - 1057-7149
VL - 29
SP - 3534
EP - 3545
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
M1 - 8949760
ER -