TY - JOUR
T1 - Deep Contrastive Principal Component Analysis Adaptive to Nonlinear Data
AU - Cao, Hongjie
AU - Wang, Gang
AU - Sun, Jian
AU - Deng, Fang
AU - Chen, Jie
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2022
Y1 - 2022
N2 - Principal component analysis (PCA) is one of the most fundamental techniques for Big Data analytics in e.g, smart manufacturing and biostatistics, which is capable of extracting the most essential information from a dataset. However, when it encounters multiple datasets, PCA cannot reveal the specific inherent data structure hidden in one dataset relative to the other(s), which we term contrastive data analytics in this paper. Although a number of proposals such as contrastive or discriminative PCA have been suggested, they require fine-tuning of hyper-parameters or cannot effectively deal with nonlinear data. In this context, we advocate deep contrastive (Dc) PCA for nonlinear contrastive data analytics, which leverages the power of deep neural networks to explore the hidden nonlinear relationships in the datasets and extract the desired contrastive features. An alternating minimization algorithm is developed for simultaneously seeking the best nonlinear transformations for the data as well as the associated contrastive projections, tantamount to performing an eigenvalue decomposition and a back-propagation step. Substantial numerical tests using both synthetic and real datasets from protein expression, electroencephalography, wearable human motion, and smart manufacturing, are conducted, which corroborate the superior adaptivity of DcPCA in dealing with nonlinear data relative to state-of-the-art alternatives.
AB - Principal component analysis (PCA) is one of the most fundamental techniques for Big Data analytics in e.g, smart manufacturing and biostatistics, which is capable of extracting the most essential information from a dataset. However, when it encounters multiple datasets, PCA cannot reveal the specific inherent data structure hidden in one dataset relative to the other(s), which we term contrastive data analytics in this paper. Although a number of proposals such as contrastive or discriminative PCA have been suggested, they require fine-tuning of hyper-parameters or cannot effectively deal with nonlinear data. In this context, we advocate deep contrastive (Dc) PCA for nonlinear contrastive data analytics, which leverages the power of deep neural networks to explore the hidden nonlinear relationships in the datasets and extract the desired contrastive features. An alternating minimization algorithm is developed for simultaneously seeking the best nonlinear transformations for the data as well as the associated contrastive projections, tantamount to performing an eigenvalue decomposition and a back-propagation step. Substantial numerical tests using both synthetic and real datasets from protein expression, electroencephalography, wearable human motion, and smart manufacturing, are conducted, which corroborate the superior adaptivity of DcPCA in dealing with nonlinear data relative to state-of-the-art alternatives.
KW - Nonlinear dimensionality reduction
KW - contrastive analytics
KW - contrastive learning
KW - deep neural network
UR - http://www.scopus.com/inward/record.url?scp=85144074820&partnerID=8YFLogxK
U2 - 10.1109/TSP.2022.3224647
DO - 10.1109/TSP.2022.3224647
M3 - Article
AN - SCOPUS:85144074820
SN - 1053-587X
VL - 70
SP - 5738
EP - 5750
JO - IEEE Transactions on Signal Processing
JF - IEEE Transactions on Signal Processing
ER -