Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning

Hongmin Cai; Weitian Huang; Sirui Yang; Siqi Ding; Yue Zhang; Bin Hu; Fa Zhang; Yiu Ming Cheung

doi:10.1109/TPAMI.2023.3346869

Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning

Hongmin Cai, Weitian Huang, Sirui Yang, Siqi Ding, Yue Zhang, Bin Hu, Fa Zhang, Yiu Ming Cheung^*

^*Corresponding author for this work

School of Medical and Technology

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

In multi-view environment, it would yield missing observations due to the limitation of the observation process. The most current representation learning methods struggle to explore complete information by lacking either cross-generative via simply filling in missing view data, or solidative via inferring a consistent representation among the existing views. To address this problem, we propose a deep generative model to learn a complete generative latent representation, namely Complete Multi-view Variational Auto-Encoders (CMVAE), which models the generation of the multiple views from a complete latent variable represented by a mixture of Gaussian distributions. Thus, the missing view can be fully characterized by the latent variables and is resolved by estimating its posterior distribution. Accordingly, a novel variational lower bound is introduced to integrate view-invariant information into posterior inference to enhance the solidative of the learned latent representation. The intrinsic correlations between views are mined to seek cross-view generality, and information leading to missing views is fused by view weights to reach solidity. Benchmark experimental results in clustering, classification, and cross-view image generation tasks demonstrate the superiority of CMVAE, while time complexity and parameter sensitivity analyses illustrate the efficiency and robustness. Additionally, application to bioinformatics data exemplifies its practical significance.

Original language	English
Article number	10373887
Pages (from-to)	3637-3652
Number of pages	16
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume	46
Issue number	5
DOIs	https://doi.org/10.1109/TPAMI.2023.3346869
Publication status	Published - 1 May 2024

Keywords

Deep generative models
incomplete multi-view problem
multi-view learning
representation learning

Access to Document

10.1109/TPAMI.2023.3346869

Cite this

Cai, H., Huang, W., Yang, S., Ding, S., Zhang, Y., Hu, B., Zhang, F., & Cheung, Y. M. (2024). Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(5), 3637-3652. Article 10373887. https://doi.org/10.1109/TPAMI.2023.3346869

@article{91e267862cdd429ea0af6928ce8c5475,

title = "Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning",

abstract = "In multi-view environment, it would yield missing observations due to the limitation of the observation process. The most current representation learning methods struggle to explore complete information by lacking either cross-generative via simply filling in missing view data, or solidative via inferring a consistent representation among the existing views. To address this problem, we propose a deep generative model to learn a complete generative latent representation, namely Complete Multi-view Variational Auto-Encoders (CMVAE), which models the generation of the multiple views from a complete latent variable represented by a mixture of Gaussian distributions. Thus, the missing view can be fully characterized by the latent variables and is resolved by estimating its posterior distribution. Accordingly, a novel variational lower bound is introduced to integrate view-invariant information into posterior inference to enhance the solidative of the learned latent representation. The intrinsic correlations between views are mined to seek cross-view generality, and information leading to missing views is fused by view weights to reach solidity. Benchmark experimental results in clustering, classification, and cross-view image generation tasks demonstrate the superiority of CMVAE, while time complexity and parameter sensitivity analyses illustrate the efficiency and robustness. Additionally, application to bioinformatics data exemplifies its practical significance.",

keywords = "Deep generative models, incomplete multi-view problem, multi-view learning, representation learning",

author = "Hongmin Cai and Weitian Huang and Sirui Yang and Siqi Ding and Yue Zhang and Bin Hu and Fa Zhang and Cheung, {Yiu Ming}",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2024",

month = may,

day = "1",

doi = "10.1109/TPAMI.2023.3346869",

language = "English",

volume = "46",

pages = "3637--3652",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "5",

}

TY - JOUR

T1 - Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning

AU - Cai, Hongmin

AU - Huang, Weitian

AU - Yang, Sirui

AU - Ding, Siqi

AU - Zhang, Yue

AU - Hu, Bin

AU - Zhang, Fa

AU - Cheung, Yiu Ming

PY - 2024/5/1

Y1 - 2024/5/1

N2 - In multi-view environment, it would yield missing observations due to the limitation of the observation process. The most current representation learning methods struggle to explore complete information by lacking either cross-generative via simply filling in missing view data, or solidative via inferring a consistent representation among the existing views. To address this problem, we propose a deep generative model to learn a complete generative latent representation, namely Complete Multi-view Variational Auto-Encoders (CMVAE), which models the generation of the multiple views from a complete latent variable represented by a mixture of Gaussian distributions. Thus, the missing view can be fully characterized by the latent variables and is resolved by estimating its posterior distribution. Accordingly, a novel variational lower bound is introduced to integrate view-invariant information into posterior inference to enhance the solidative of the learned latent representation. The intrinsic correlations between views are mined to seek cross-view generality, and information leading to missing views is fused by view weights to reach solidity. Benchmark experimental results in clustering, classification, and cross-view image generation tasks demonstrate the superiority of CMVAE, while time complexity and parameter sensitivity analyses illustrate the efficiency and robustness. Additionally, application to bioinformatics data exemplifies its practical significance.

AB - In multi-view environment, it would yield missing observations due to the limitation of the observation process. The most current representation learning methods struggle to explore complete information by lacking either cross-generative via simply filling in missing view data, or solidative via inferring a consistent representation among the existing views. To address this problem, we propose a deep generative model to learn a complete generative latent representation, namely Complete Multi-view Variational Auto-Encoders (CMVAE), which models the generation of the multiple views from a complete latent variable represented by a mixture of Gaussian distributions. Thus, the missing view can be fully characterized by the latent variables and is resolved by estimating its posterior distribution. Accordingly, a novel variational lower bound is introduced to integrate view-invariant information into posterior inference to enhance the solidative of the learned latent representation. The intrinsic correlations between views are mined to seek cross-view generality, and information leading to missing views is fused by view weights to reach solidity. Benchmark experimental results in clustering, classification, and cross-view image generation tasks demonstrate the superiority of CMVAE, while time complexity and parameter sensitivity analyses illustrate the efficiency and robustness. Additionally, application to bioinformatics data exemplifies its practical significance.

KW - Deep generative models

KW - incomplete multi-view problem

KW - multi-view learning

KW - representation learning

UR - http://www.scopus.com/inward/record.url?scp=85181573597&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2023.3346869

DO - 10.1109/TPAMI.2023.3346869

M3 - Article

C2 - 38145535

AN - SCOPUS:85181573597

SN - 0162-8828

VL - 46

SP - 3637

EP - 3652

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 5

M1 - 10373887

ER -

Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this