TY - JOUR
T1 - Deep Scattering Network With Fractional Wavelet Transform
AU - Shi, Jun
AU - Zhao, Yanan
AU - Xiang, Wei
AU - Monga, Vishal
AU - Liu, Xiaoping
AU - Tao, Ran
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Deep convolutional neural networks (DCNNs) have recently emerged as a powerful tool to deliver breakthrough performances in various image analysis and processing applications. However, DCNNs lack a strong theoretical foundation and require massive amounts of training data. More recently, the deep scattering network (DSN), a variant of DCNNs, has been proposed to address these issues. DSNs inherit the hierarchical structure of DCNNs, but replace data-driven linear filters with predefined fixed multi-scale wavelet filters, which facilitate an in-depth understanding of DCNNs and also offer the state-of-the-art performance in image classification. Unfortunately, DSNs suffer from a major drawback: they are suitable for stationary image textures but not non-stationary image textures, since 2D wavelets are intrinsically linear translation-invariant filters in the Fourier transform domain. The objective of this paper is to overcome this drawback using the fractional wavelet transform (FRWT) which can be viewed as a bank of linear translation-variant multi-scale filters and thus may be well suited for non-stationary texture analysis. We first propose the fractional wavelet scattering transform (FRWST) based upon the FRWT. Then, we present a generalized structure for the DSN by cascading fractional wavelet convolutions and modulus operators. Basic properties of this generalized DSN are derived, followed by a fast implementation of the generalized DSN as well as their practical applications. The theoretical derivations are validated via computer simulations.
AB - Deep convolutional neural networks (DCNNs) have recently emerged as a powerful tool to deliver breakthrough performances in various image analysis and processing applications. However, DCNNs lack a strong theoretical foundation and require massive amounts of training data. More recently, the deep scattering network (DSN), a variant of DCNNs, has been proposed to address these issues. DSNs inherit the hierarchical structure of DCNNs, but replace data-driven linear filters with predefined fixed multi-scale wavelet filters, which facilitate an in-depth understanding of DCNNs and also offer the state-of-the-art performance in image classification. Unfortunately, DSNs suffer from a major drawback: they are suitable for stationary image textures but not non-stationary image textures, since 2D wavelets are intrinsically linear translation-invariant filters in the Fourier transform domain. The objective of this paper is to overcome this drawback using the fractional wavelet transform (FRWT) which can be viewed as a bank of linear translation-variant multi-scale filters and thus may be well suited for non-stationary texture analysis. We first propose the fractional wavelet scattering transform (FRWST) based upon the FRWT. Then, we present a generalized structure for the DSN by cascading fractional wavelet convolutions and modulus operators. Basic properties of this generalized DSN are derived, followed by a fast implementation of the generalized DSN as well as their practical applications. The theoretical derivations are validated via computer simulations.
KW - Deep convolutional neural networks
KW - fractional wavelet transform
KW - translation-variant filtering
KW - wavelet scattering
UR - http://www.scopus.com/inward/record.url?scp=85112652169&partnerID=8YFLogxK
U2 - 10.1109/TSP.2021.3098936
DO - 10.1109/TSP.2021.3098936
M3 - Article
AN - SCOPUS:85112652169
SN - 1053-587X
VL - 69
SP - 4740
EP - 4757
JO - IEEE Transactions on Signal Processing
JF - IEEE Transactions on Signal Processing
ER -