Skip to main navigation Skip to search Skip to main content

An Optimal-Transport-Based Multimodal Big Data Clustering

  • Zheng Yang*
  • , Chongyang Shi
  • , Ying Guan
  • *Corresponding author for this work
  • Shenyang Institute of Engineering

Research output: Contribution to journalArticlepeer-review

Abstract

Multimodal clustering achieves outstanding performance in various applications by aggregating information from heterogeneous devices. However, previous methods rely on strong-notion distances to fuse crossmodal complementary knowledge, established on a fragile assumption about the existence of a ubiquitous non-negligible intersection between heterogeneous manifolds of modalities. Due to this unstable theoretical basis, previous methods are essentially challenged by limited performance on general multimodal data. To address this challenge, an optimal-transport-based multimodal clustering (OTMC) method is defined as the optimal transport (OT) from multimodal data distributions to clustering distributions, which leverages a weak-topology measure to capture complementary knowledge with clear discriminative structures. OTMC consists of a modality-specific OT delivering private structures and a modality-common OT delivering shared structures, which transports category structures scattered in manifolds of each modality and all modalities to common prototypes, respectively. Furthermore, variational solutions to OTMC are derived by matching the data-prototype joint distribution, which induces the multimodal OT clustering network, to capture discriminative structures. Finally, the experimental results from four real-world datasets demonstrate the superiority of OTMC, helped by never relying on the phantom of heterogeneous manifold intersections. In particular, OTMC obtains 92.15% ACC, 84.96% NMI, and 83.35% ARI on Handwritten, improving by 2.25%, 2.82%, and 3.28%, respectively.

Original languageEnglish
Article number666
JournalElectronics (Switzerland)
Volume14
Issue number4
DOIs
Publication statusPublished - Feb 2025

Keywords

  • big data
  • deep multimodal clustering
  • heterogeneous manifold
  • multimodal data
  • optimal transport
  • variational solution

Fingerprint

Dive into the research topics of 'An Optimal-Transport-Based Multimodal Big Data Clustering'. Together they form a unique fingerprint.

Cite this