Generalising combinatorial discriminant analysis through conditioning truncated Rayleigh flow

  • Sijia Yang
  • , Haoyi Xiong
  • , Di Hu
  • , Kaibo Xu
  • , Licheng Wang*
  • , Peizhen Zhu
  • , Zeyi Sun*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Fisher’s Linear Discriminant Analysis (LDA) has been widely used for linear classification, feature selection, and metrics learning in multivariate data analytics. To ensure high classification accuracy while optimally discovering predictive features from the data, this paper studied CDA, namely Combinatorial Discriminant Analysis that intends to combinatorially select a subset of features and assign weights to them optimally. CDA extents the Truncated Rayleigh Flow algorithm (Tan et al. in J R Stat Soc: Ser B (Stat Methodol) 80(5):1057–1086, 2018) and improves LDA estimation under k-sparsity constraint. The experimental results based on the synthesized and real-world datasets demonstrate that our algorithm outperforms other LDA baselines and downstream classifiers. The empirical analysis shows that our algorithm can recover the combinatorial structure of optimal LDA with empirical consistency.

Original languageEnglish
Pages (from-to)2189-2208
Number of pages20
JournalKnowledge and Information Systems
Volume63
Issue number8
DOIs
Publication statusPublished - Aug 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'Generalising combinatorial discriminant analysis through conditioning truncated Rayleigh flow'. Together they form a unique fingerprint.

Cite this