A systematic review of software fault prediction studies

Catal C., Diri B.

EXPERT SYSTEMS WITH APPLICATIONS, vol.36, pp.7346-7354, 2009 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Review
  • Volume: 36
  • Publication Date: 2009
  • Doi Number: 10.1016/j.eswa.2008.10.027
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.7346-7354
  • Yıldız Technical University Affiliated: Yes


This paper provides a systematic review of previous software fault prediction studies with a specific focus on metrics, methods, and datasets. The review uses 74 software fault prediction papers in I I journals and several conference proceedings. According to the review results, the usage percentage of public datasets increased significantly and the usage percentage of machine learning algorithms increased slightly since 2005. In addition, method-level metrics are still the most dominant metrics in fault prediction research area and machine learning algorithms are still the most popular methods for fault prediction. Researchers working on software fault prediction area should continue to use public datasets and machine learning algorithms to build better fault predictors. The usage percentage of class-level is beyond acceptable levels and they should be used much more than they are now in order to predict the faults earlier in design phase of software life cycle. (C) 2008 Elsevier Ltd. All rights reserved.