Hilbert Curve Projection Distance for Distribution Comparison

Tao Li, Cheng Meng, Hongteng Xu*, Jun Yu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 2
  • Captures
    • Readers: 9
see details

Abstract

Distribution comparison plays a central role in many machine learning tasks like data classification and generative mod- eling. In this study, we propose a novel metric, called Hilbert curve projection (HCP) distance, to measure the distance between two probability distributions with low complexity. In particular, we first project two high-dimensional probability distributions using Hilbert curve to obtain a coupling between them, and then cal- culate the transport distance between these two distributions in the original space, according to the coupling. We show that HCP distance is a proper metric and is well-defined for probability measures with bounded supports. Furthermore, we demonstrate that the modified empirical HCP distance with the Lp cost in the d-dimensional space converges to its population counterpart at a rate of no more than O(n−1/2 max{d,p}). To suppress the curse-of-dimensionality, we also develop two variants of the HCP distance using (learnable) subspace projections. Experiments on both synthetic and real-world data show that our HCP distance works as an effective surrogate of the Wasserstein distance with low complexity and overcomes the drawbacks of the sliced Wasserstein distance.

Original languageEnglish
Pages (from-to)4993-5007
Number of pages15
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume46
Issue number7
DOIs
Publication statusPublished - 1 Jul 2024

Keywords

  • Distribution comparison
  • Hilbert curve
  • Wasserstein distance
  • optimal transport
  • projection robust Wasserstein distance

Fingerprint

Dive into the research topics of 'Hilbert Curve Projection Distance for Distribution Comparison'. Together they form a unique fingerprint.

Cite this

Li, T., Meng, C., Xu, H., & Yu, J. (2024). Hilbert Curve Projection Distance for Distribution Comparison. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(7), 4993-5007. https://doi.org/10.1109/TPAMI.2024.3363780