高维数据非参数密度估计的低维流形代表点法

Shuliang Wang, Ying Li, Jing Geng*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

When learning from high-dimensional sample data in big data, the non-parametric kernel method uses a unified metric, which is prone to dimensional disasters. If the low-dimensional geometric characteristics embedded in the high-dimensional space are found, it is helpful to reveal the manifold structure of the data distribution, and the high-dimensional data with limited samples can be used to approximate the true distribution of the data in the low-dimensional subspace. Based on this, this paper proposes a new low-dimensional manifold representative point method for non-parametric density estimation of high-dimensional data, which estimates the density by mining the geometric structure of the data distribution from the high-dimensional space. First, the local covariance matrix is calculated and the local data distribution is characterized by looking for points in the local area that can represent the main direction of the manifold structure. Then, each sample data point contribution is weight to density considering the different effects of the data points on or near the manifold structure. The experimental results show that, compared with the traditional kernel density estimation method and the manifold kernel density method, our proposed method can quickly and robustly perform density estimation and reflect the true distribution of data.

投稿的翻译标题A Low-Dimensional Manifold Representative Point Method to Estimate the Non-parametric Density for High-Dimensional Data
源语言繁体中文
页(从-至)65-70
页数6
期刊Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University
46
1
DOI
出版状态已出版 - 5 1月 2021

关键词

  • Cross-validated likelihood
  • High-dimensional data
  • Kernel density estimation
  • Low-dimensional manifold representative point method
  • Non-parametric density estimation

指纹

探究 '高维数据非参数密度估计的低维流形代表点法' 的科研主题。它们共同构成独一无二的指纹。

引用此