Software defect prediction: do different classifiers find the same defects?

Bowes, David; Hall, Tracy; petric, Jean

dc.contributor.author	Bowes, David
dc.contributor.author	Hall, Tracy
dc.contributor.author	petric, Jean
dc.date.accessioned	2018-08-16T00:11:46Z
dc.date.available	2018-08-16T00:11:46Z
dc.date.issued	2017-02-07
dc.identifier.citation	Bowes , D , Hall , T & petric , J 2017 , ' Software defect prediction: do different classifiers find the same defects? ' , Software Quality Journal , vol. 26 , pp. 525–552 . https://doi.org/10.1007/s11219-016-9353-3
dc.identifier.issn	0963-9314
dc.identifier.uri	http://hdl.handle.net/2299/20367
dc.description	Open Access: This article is distributed under the terms of the Creative Commons Attribution 4.0 International License CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
dc.description.abstract	During the last 10 years, hundreds of different defect prediction models have been published. The performance of the classifiers used in these models is reported to be similar with models rarely performing above the predictive performance ceiling of about 80% recall. We investigate the individual defects that four classifiers predict and analyse the level of prediction uncertainty produced by these classifiers. We perform a sensitivity analysis to compare the performance of Random Forest, Naïve Bayes, RPart and SVM classifiers when predicting defects in NASA, open source and commercial datasets. The defect predictions that each classifier makes is captured in a confusion matrix and the prediction uncertainty of each classifier is compared. Despite similar predictive performance values for these four classifiers, each detects different sets of defects. Some classifiers are more consistent in predicting defects than others. Our results confirm that a unique subset of defects can be detected by specific classifiers. However, while some classifiers are consistent in the predictions they make, other classifiers vary in their predictions. Given our results, we conclude that classifier ensembles with decision-making strategies not based on majority voting are likely to perform best in defect prediction.	en
dc.format.extent	28
dc.format.extent	2830639
dc.language.iso	eng
dc.relation.ispartof	Software Quality Journal
dc.subject	software defect prediction
dc.subject	prediction modelling
dc.subject	machine learning
dc.title	Software defect prediction: do different classifiers find the same defects?	en
dc.contributor.institution	School of Computer Science
dc.contributor.institution	Centre for Computer Science and Informatics Research
dc.description.status	Peer reviewed
rioxxterms.versionofrecord	10.1007/s11219-016-9353-3
rioxxterms.type	Journal Article/Review
herts.preservation.rarelyaccessed	true

Files in this item

Name:: Published_Version.pdf
Size:: 2.699Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Research publications

Show simple item record