TY - JOUR
T1 - Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning
AU - Cai, Hongmin
AU - Huang, Weitian
AU - Yang, Sirui
AU - Ding, Siqi
AU - Zhang, Yue
AU - Hu, Bin
AU - Zhang, Fa
AU - Cheung, Yiu Ming
N1 - Publisher Copyright:
© 1979-2012 IEEE.
PY - 2024/5/1
Y1 - 2024/5/1
N2 - In multi-view environment, it would yield missing observations due to the limitation of the observation process. The most current representation learning methods struggle to explore complete information by lacking either cross-generative via simply filling in missing view data, or solidative via inferring a consistent representation among the existing views. To address this problem, we propose a deep generative model to learn a complete generative latent representation, namely Complete Multi-view Variational Auto-Encoders (CMVAE), which models the generation of the multiple views from a complete latent variable represented by a mixture of Gaussian distributions. Thus, the missing view can be fully characterized by the latent variables and is resolved by estimating its posterior distribution. Accordingly, a novel variational lower bound is introduced to integrate view-invariant information into posterior inference to enhance the solidative of the learned latent representation. The intrinsic correlations between views are mined to seek cross-view generality, and information leading to missing views is fused by view weights to reach solidity. Benchmark experimental results in clustering, classification, and cross-view image generation tasks demonstrate the superiority of CMVAE, while time complexity and parameter sensitivity analyses illustrate the efficiency and robustness. Additionally, application to bioinformatics data exemplifies its practical significance.
AB - In multi-view environment, it would yield missing observations due to the limitation of the observation process. The most current representation learning methods struggle to explore complete information by lacking either cross-generative via simply filling in missing view data, or solidative via inferring a consistent representation among the existing views. To address this problem, we propose a deep generative model to learn a complete generative latent representation, namely Complete Multi-view Variational Auto-Encoders (CMVAE), which models the generation of the multiple views from a complete latent variable represented by a mixture of Gaussian distributions. Thus, the missing view can be fully characterized by the latent variables and is resolved by estimating its posterior distribution. Accordingly, a novel variational lower bound is introduced to integrate view-invariant information into posterior inference to enhance the solidative of the learned latent representation. The intrinsic correlations between views are mined to seek cross-view generality, and information leading to missing views is fused by view weights to reach solidity. Benchmark experimental results in clustering, classification, and cross-view image generation tasks demonstrate the superiority of CMVAE, while time complexity and parameter sensitivity analyses illustrate the efficiency and robustness. Additionally, application to bioinformatics data exemplifies its practical significance.
KW - Deep generative models
KW - incomplete multi-view problem
KW - multi-view learning
KW - representation learning
UR - http://www.scopus.com/inward/record.url?scp=85181573597&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2023.3346869
DO - 10.1109/TPAMI.2023.3346869
M3 - Article
C2 - 38145535
AN - SCOPUS:85181573597
SN - 0162-8828
VL - 46
SP - 3637
EP - 3652
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 5
M1 - 10373887
ER -