Abstract
Distance-based regression model has become a powerful approach to identifying phenotypic associations in many fields. It is found to be particularly useful for high-dimensional biological and genetic data with proper distance or similarity measures being available. The pseudo F statistic used in this model accumulates information and is effective when the signals, that is the variations represented by the eigenvalues of the similarity matrix, scatter evenly along the eigenvectors of the similarity matrix. However, it might lose power for the uneven signals. To deal with this issue, we propose a group analysis on the variations of signals along the eigenvalues of the similarity matrix and take the maximum among them. The new procedure can automatically choose an optimal grouping point on some given thresholds and thus can improve the power evidence. Extensive computer simulations and applications to a prostate cancer data and an aging human brain data illustrate the effectiveness of the proposed method.
Original language | English |
---|---|
Pages (from-to) | 620-628 |
Number of pages | 9 |
Journal | Genetic Epidemiology |
Volume | 44 |
Issue number | 6 |
DOIs | |
Publication status | Published - 1 Sept 2020 |
Externally published | Yes |
Keywords
- distance-based regression
- eigenvalue decomposition
- pseudo F test statistic