TY - JOUR
T1 - Establishing a software defect prediction model via effective dimension reduction
AU - Wei, Hua
AU - Hu, Changzhen
AU - Chen, Shiyou
AU - Xue, Yuan
AU - Zhang, Quanxin
N1 - Publisher Copyright:
© 2018
PY - 2019/3
Y1 - 2019/3
N2 - With the continued growth of interoperable software developed for Internet of Things (IoT), there is a growing demand to predict software defect at various testing and operational phases. This paper solves the software defect prediction problem by proposing a novel model with the help of a local tangent space alignment support vector machine (LTSA-SVM) algorithm. The model employs the SVM algorithm as the basic classifier of software defect distribution prediction model. Then, the model parameters are optimized by combining a grid search method and ten-fold cross validation. In the traditional dimensionality reduction algorithms, data loss caused by the poor attributes of data nonlinearity reduces the accuracy of SVM. Aiming at this problem, this paper uses a LTSA algorithm to extract the intrinsic structure of low-dimensional data and performs effective dimension reduction. The SVM algorithm is trained by the reduced dimension data. Finally, the feasibility of the prediction model is verified. Compared with the single SVM and the LLE-SVM prediction algorithm, the prediction model in this paper improves the prediction accuracy and F-measure by 1–4%.
AB - With the continued growth of interoperable software developed for Internet of Things (IoT), there is a growing demand to predict software defect at various testing and operational phases. This paper solves the software defect prediction problem by proposing a novel model with the help of a local tangent space alignment support vector machine (LTSA-SVM) algorithm. The model employs the SVM algorithm as the basic classifier of software defect distribution prediction model. Then, the model parameters are optimized by combining a grid search method and ten-fold cross validation. In the traditional dimensionality reduction algorithms, data loss caused by the poor attributes of data nonlinearity reduces the accuracy of SVM. Aiming at this problem, this paper uses a LTSA algorithm to extract the intrinsic structure of low-dimensional data and performs effective dimension reduction. The SVM algorithm is trained by the reduced dimension data. Finally, the feasibility of the prediction model is verified. Compared with the single SVM and the LLE-SVM prediction algorithm, the prediction model in this paper improves the prediction accuracy and F-measure by 1–4%.
KW - Dimensionality reduction
KW - Metric data program
KW - Software defect prediction
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=85055880331&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2018.10.056
DO - 10.1016/j.ins.2018.10.056
M3 - Article
AN - SCOPUS:85055880331
SN - 0020-0255
VL - 477
SP - 399
EP - 409
JO - Information Sciences
JF - Information Sciences
ER -