Detecting Three-Dimensional Associations in Large Data Set

L. I.U. Chuanlu*, W. A.N.G. Shuliang*, Y. U.A.N. Hanning, G. E.N.G. Jing

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

The associations detection among variables in the large dataset is recently important due to the rapid growth rate of data. The interested associations can provide references for solving the problems such as dimension reduction and feature selection. Many methods have done on the associations detection of pairwise variables. The multi-dimensional variables, especially three-dimensional variables, is rarely studied. The relationships among them cannot be revealed by the detection of pairwise variables methods. A new method of Maximal three-dimensional information coefficient (MTDIC) is proposed which is able to indicate the associations of three-dimensional variables. The correlation coefficient is calculated from the three-dimensional mutual information. The World Health Organization (WHO) data and the Tara data are selected to evaluate their associations. The experiment is verified by comparing the coefficient results with the Distance correlation (Dcor). The accurate association strength is obtained by an iterative optimization procedure on sorting descending order of coefficients. The MTDIC performs better than the Dcor in generality and equitability properties.

Original languageEnglish
Pages (from-to)1131-1140
Number of pages10
JournalChinese Journal of Electronics
Volume30
Issue number6
DOIs
Publication statusPublished - Nov 2021

Keywords

  • Associations
  • Iterative optimization
  • Mutual information
  • Three-dimensional variables

Fingerprint

Dive into the research topics of 'Detecting Three-Dimensional Associations in Large Data Set'. Together they form a unique fingerprint.

Cite this