Detecting Three-Dimensional Associations in Large Data Set

L. I.U. Chuanlu*, W. A.N.G. Shuliang*, Y. U.A.N. Hanning, G. E.N.G. Jing

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

The associations detection among variables in the large dataset is recently important due to the rapid growth rate of data. The interested associations can provide references for solving the problems such as dimension reduction and feature selection. Many methods have done on the associations detection of pairwise variables. The multi-dimensional variables, especially three-dimensional variables, is rarely studied. The relationships among them cannot be revealed by the detection of pairwise variables methods. A new method of Maximal three-dimensional information coefficient (MTDIC) is proposed which is able to indicate the associations of three-dimensional variables. The correlation coefficient is calculated from the three-dimensional mutual information. The World Health Organization (WHO) data and the Tara data are selected to evaluate their associations. The experiment is verified by comparing the coefficient results with the Distance correlation (Dcor). The accurate association strength is obtained by an iterative optimization procedure on sorting descending order of coefficients. The MTDIC performs better than the Dcor in generality and equitability properties.

源语言英语
页(从-至)1131-1140
页数10
期刊Chinese Journal of Electronics
30
6
DOI
出版状态已出版 - 11月 2021

指纹

探究 'Detecting Three-Dimensional Associations in Large Data Set' 的科研主题。它们共同构成独一无二的指纹。

引用此