跳到主要导航 跳到搜索 跳到主要内容

Ensemble multiboost based on ripper classifier for prediction of imbalanced software defect data

  • Haitao He
  • , Xu Zhang
  • , Qian Wang
  • , Jiadong Ren
  • , Jiaxin Liu
  • , Xiaolin Zhao
  • , Yongqiang Cheng

科研成果: 期刊稿件文章同行评审

摘要

Identifying defective software entities is essential to ensure software quality during software development. However, the high dimensionality and class distribution imbalance of software defect data seriously affect software defect prediction performance. In order to solve this problem, this paper proposes an Ensemble MultiBoost based on RIPPER classifier for prediction of imbalanced Software Defect data, called EMR_SD. Firstly, the algorithm uses principal component analysis (PCA) method to find out the most effective features from the original features of the data set, so as to achieve the purpose of dimensionality reduction and redundancy removal. Furthermore, the combined sampling method of adaptive synthetic sampling (ADASYN) and random sampling without replacement is performed to solve the problem of data class imbalance. This classifier establishes association rules based on attributes and classes, using MultiBoost to reduce deviation and variance, so as to achieve the purpose of reducing classification error. The proposed prediction model is evaluated experimentally on the NASA MDP public datasets and compared with existing similar algorithms. The results show that EMR-SD algorithm is superior to DNC, CEL and other defect prediction techniques in most evaluation indicators, which proves the effectiveness of the algorithm.

源语言英语
文章编号2934128
页(从-至)110333-110343
页数11
期刊IEEE Access
7
DOI
出版状态已出版 - 2019

指纹

探究 'Ensemble multiboost based on ripper classifier for prediction of imbalanced software defect data' 的科研主题。它们共同构成独一无二的指纹。

引用此