跳到主要导航 跳到搜索 跳到主要内容

Feature Selection for Optimized High-Dimensional Biomedical Data Using an Improved Shuffled Frog Leaping Algorithm

  • Bin Hu
  • , Yongqiang Dai
  • , Yun Su
  • , Philip Moore
  • , Xiaowei Zhang
  • , Chengsheng Mao
  • , Jing Chen
  • , Lixin Xu

科研成果: 期刊稿件文章同行评审

摘要

High dimensional biomedical datasets contain thousands of features which can be used in molecular diagnosis of disease, however, such datasets contain many irrelevant or weak correlation features which influence the predictive accuracy of diagnosis. Without a feature selection algorithm, it is difficult for the existing classification techniques to accurately identify patterns in the features. The purpose of feature selection is to not only identify a feature subset from an original set of features [without reducing the predictive accuracy of classification algorithm] but also reduce the computation overhead in data mining. In this paper, we present our improved shuffled frog leaping algorithm which introduces a chaos memory weight factor, an absolute balance group strategy, and an adaptive transfer factor. Our proposed approach explores the space of possible subsets to obtain the set of features that maximizes the predictive accuracy and minimizes irrelevant features in high-dimensional biomedical data. To evaluate the effectiveness of our proposed method, we have employed the K-nearest neighbor method with a comparative analysis in which we compare our proposed approach with genetic algorithms, particle swarm optimization, and the shuffled frog leaping algorithm. Experimental results show that our improved algorithm achieves improvements in the identification of relevant subsets and in classification accuracy.

源语言英语
文章编号7551172
页(从-至)1765-1773
页数9
期刊IEEE/ACM Transactions on Computational Biology and Bioinformatics
15
6
DOI
出版状态已出版 - 1 11月 2018
已对外发布

指纹

探究 'Feature Selection for Optimized High-Dimensional Biomedical Data Using an Improved Shuffled Frog Leaping Algorithm' 的科研主题。它们共同构成独一无二的指纹。

引用此