基于鲸群优化随机森林算法的非平衡数据分类

Translated title of the contribution: Unbalanced data classification based on whale swarm optimization random forest algorithm

Lizhu Ye, Donghua Zheng, Yuehong Liu, Shaohua Niu

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

In order to improve the accuracy of unbalanced data classification, the random forest algorithm is used for data classification, and the whale optimization algorithm is adoped to optimize the key parameters of the random forest, thus the adaptability of the random forest algorithm to unbalanced data classification is enhanced. First, the unbalanced data classification model is developed based on the random forest. The classification difficulties caused by sample imbalance are effectively solved through multiple decision tree weak classifiers of the random forest. Second, the whale swarm optimization algorithm is deployed to optimize the weight of weak classifiers, and the average classification accuracy is taken as the fitness function of the whale swarm optimization. Thus the accuracy of the weak classifier weight voting on the final classification results. Finally, the random forest model optimized by the whale population is used to classify the unbalanced data. Experiments show that by reasonably setting the parameters of the whale swarm optimization algorithm, the weight of random forest weak classifiers with higher classification accuracy can be obtained. Compared with the unbalanced data classification algorithms, this algorithm can obtain better classification performance.

Translated title of the contributionUnbalanced data classification based on whale swarm optimization random forest algorithm
Original languageChinese (Traditional)
Pages (from-to)99-105
Number of pages7
JournalNanjing Youdian Daxue Xuebao (Ziran Kexue Ban)/Journal of Nanjing University of Posts and Telecommunications (Natural Science)
Volume42
Issue number6
DOIs
Publication statusPublished - Dec 2022

Fingerprint

Dive into the research topics of 'Unbalanced data classification based on whale swarm optimization random forest algorithm'. Together they form a unique fingerprint.

Cite this