Investigating associative classification for software fault prediction: An experimental perspective

Baojun Ma, Huaping Zhang, Guoqing Chen*, Yanping Zhao, Bart Baesens

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

23 Citations (Scopus)

Abstract

It is a recurrent finding that software development is often troubled by considerable delays as well as budget overruns and several solutions have been proposed in answer to this observation, software fault prediction being a prime example. Drawing upon machine learning techniques, software fault prediction tries to identify upfront software modules that are most likely to contain faults, thereby streamlining testing efforts and improving overall software quality. When deploying fault prediction models in a production environment, both prediction performance and model comprehensibility are typically taken into consideration, although the latter is commonly overlooked in the academic literature. Many classification methods have been suggested to conduct fault prediction; yet associative classification methods remain uninvestigated in this context. This paper proposes an associative classification (AC)-based fault prediction method, building upon the CBA2 algorithm. In an empirical comparison on 12 real-world datasets, the AC-based classifier is shown to achieve a predictive performance competitive to those of models induced by five other tree/rule-based classification techniques. In addition, our findings also highlight the comprehensibility of the AC-based models, while achieving similar prediction performance. Furthermore, the possibilities of cross project prediction are investigated, strengthening earlier findings on the feasibility of such approach when insufficient data on the target project is available.

Original languageEnglish
Pages (from-to)61-90
Number of pages30
JournalInternational Journal of Software Engineering and Knowledge Engineering
Volume24
Issue number1
DOIs
Publication statusPublished - Feb 2014

Keywords

  • Software fault prediction
  • associative classification
  • comprehensibility
  • cross project validation
  • prediction performance

Fingerprint

Dive into the research topics of 'Investigating associative classification for software fault prediction: An experimental perspective'. Together they form a unique fingerprint.

Cite this