基于鲸群优化随机森林算法的非平衡数据分类

Lizhu Ye, Donghua Zheng, Yuehong Liu, Shaohua Niu

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

In order to improve the accuracy of unbalanced data classification, the random forest algorithm is used for data classification, and the whale optimization algorithm is adoped to optimize the key parameters of the random forest, thus the adaptability of the random forest algorithm to unbalanced data classification is enhanced. First, the unbalanced data classification model is developed based on the random forest. The classification difficulties caused by sample imbalance are effectively solved through multiple decision tree weak classifiers of the random forest. Second, the whale swarm optimization algorithm is deployed to optimize the weight of weak classifiers, and the average classification accuracy is taken as the fitness function of the whale swarm optimization. Thus the accuracy of the weak classifier weight voting on the final classification results. Finally, the random forest model optimized by the whale population is used to classify the unbalanced data. Experiments show that by reasonably setting the parameters of the whale swarm optimization algorithm, the weight of random forest weak classifiers with higher classification accuracy can be obtained. Compared with the unbalanced data classification algorithms, this algorithm can obtain better classification performance.

投稿的翻译标题Unbalanced data classification based on whale swarm optimization random forest algorithm
源语言繁体中文
页(从-至)99-105
页数7
期刊Nanjing Youdian Daxue Xuebao (Ziran Kexue Ban)/Journal of Nanjing University of Posts and Telecommunications (Natural Science)
42
6
DOI
出版状态已出版 - 12月 2022

关键词

  • decision tree
  • random forest
  • unbalanced data classification
  • weak classifier
  • whale swarm optimization algorithm

指纹

探究 '基于鲸群优化随机森林算法的非平衡数据分类' 的科研主题。它们共同构成独一无二的指纹。

引用此