Multiple Kernel Geometric Mean Metric Learning for Heterogeneous Data

  • Ren Qi
  • , Peng Fei Zhu*
  • , Jian Qing Liang
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

How to choose a proper distance metric is vital to many machine learning and pattern recognition tasks. Metric learning mainly uses discriminant information to learn a Mahalanobis distance or similarity metric. However, most existing metric learning methods are for numerical data, and it is unreasonable to calculate the similarity between two heterogeneous objects (e.g., categorical data) using traditional distance metrics. Besides, they suffer from curse of dimensionality, resulting in poor efficiency and scalability when the feature dimension is very high. In this paper, a geometric mean metric learning method is proposed for heterogeneous data. The numerical data and categorical data are mapped to a reproducing kernel Hilbert space by using different kernel functions, thus avoiding the negative influence of the high dimensionality of the feature. At the same time, a multiple kernel metric learning model based on geometric mean is introduced to transform the metric learning problem of heterogeneous data into solving the midpoint between two points on the Riemannian manifold. Experiments on benchmark UCI datasets show that the presented method shows promising performances in terms of accuracy in comparison with the state-of-the-art metric learning methods.

Original languageEnglish
Pages (from-to)2992-3001
Number of pages10
JournalRuan Jian Xue Bao/Journal of Software
Volume28
Issue number11
DOIs
Publication statusPublished - 1 Nov 2017
Externally publishedYes

Keywords

  • Geometric mean
  • Heterogeneous data
  • Metric learning
  • Multi-kernel learning

Fingerprint

Dive into the research topics of 'Multiple Kernel Geometric Mean Metric Learning for Heterogeneous Data'. Together they form a unique fingerprint.

Cite this