Group analysis of distance matrices

Jinjuan Wang, Jialu Li, Wenjun Xiong, Qizhai Li*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)

Abstract

Distance-based regression model has become a powerful approach to identifying phenotypic associations in many fields. It is found to be particularly useful for high-dimensional biological and genetic data with proper distance or similarity measures being available. The pseudo F statistic used in this model accumulates information and is effective when the signals, that is the variations represented by the eigenvalues of the similarity matrix, scatter evenly along the eigenvectors of the similarity matrix. However, it might lose power for the uneven signals. To deal with this issue, we propose a group analysis on the variations of signals along the eigenvalues of the similarity matrix and take the maximum among them. The new procedure can automatically choose an optimal grouping point on some given thresholds and thus can improve the power evidence. Extensive computer simulations and applications to a prostate cancer data and an aging human brain data illustrate the effectiveness of the proposed method.

Original languageEnglish
Pages (from-to)620-628
Number of pages9
JournalGenetic Epidemiology
Volume44
Issue number6
DOIs
Publication statusPublished - 1 Sept 2020
Externally publishedYes

Keywords

  • distance-based regression
  • eigenvalue decomposition
  • pseudo F test statistic

Fingerprint

Dive into the research topics of 'Group analysis of distance matrices'. Together they form a unique fingerprint.

Cite this