Software Defect Prediction Using Ensemble Learning: A Systematic Literature Review

Matloob, Faseeha; Ghazal, Taher M.; Taleb, Nasser; Aftab, Shabib; Ahmad, Munir; Khan, Muhammad Adnan; Abbas, Sagheer; Soomro, Tariq Rahim

Detailed Information

Cited 31 time in webofscience

Cited 54 time in scopus

Metadata Downloads

Software Defect Prediction Using Ensemble Learning: A Systematic Literature Review

Authors: Matloob, Faseeha; Ghazal, Taher M.; Taleb, Nasser; Aftab, Shabib; Ahmad, Munir; Khan, Muhammad Adnan; Abbas, Sagheer; Soomro, Tariq Rahim

Issue Date: Jul-2021

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Software; Systematics; Data mining; Tools; Predictive models; Machine learning algorithms; Bibliographies; Systematic literature review (SLR); ensemble classifier; hybrid classifier; software defect prediction

Citation: IEEE ACCESS, v.9, pp.98754 - 98771

Journal Title: IEEE ACCESS

Volume: 9

Start Page: 98754

End Page: 98771

URI: https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/81829

DOI: 10.1109/ACCESS.2021.3095559

ISSN: 2169-3536

Abstract: Recent advances in the domain of software defect prediction (SDP) include the integration of multiple classification techniques to create an ensemble or hybrid approach. This technique was introduced to improve the prediction performance by overcoming the limitations of any single classification technique. This research provides a systematic literature review on the use of the ensemble learning approach for software defect prediction. The review is conducted after critically analyzing research papers published since 2012 in four well-known online libraries: ACM, IEEE, Springer Link, and Science Direct. In this study, five research questions covering the different aspects of research progress on the use of ensemble learning for software defect prediction are addressed. To extract the answers to identified questions, 46 most relevant papers are shortlisted after a thorough systematic research process. This study will provide compact information regarding the latest trends and advances in ensemble learning for software defect prediction and provide a baseline for future innovations and further reviews. Through our study, we discovered that frequently employed ensemble methods by researchers are the random forest, boosting, and bagging. Less frequently employed methods include stacking, voting and Extra Trees. Researchers proposed many promising frameworks, such as EMKCA, SMOTE-Ensemble, MKEL, SDAEsTSE, TLEL, and LRCR, using ensemble learning methods. The AUC, accuracy, F-measure, Recall, Precision, and MCC were mostly utilized to measure the prediction performance of models. WEKA was widely adopted as a platform for machine learning. Many researchers showed through empirical analysis that features selection, and data sampling was necessary pre-processing steps that improve the performance of ensemble classifiers.

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Khan, Muhammad Adnan photo

Khan, Muhammad Adnan: College of IT Convergence (Department of Software)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :4,228,871; Today View :638

RSS_1.0 RSS_2.0 ATOM_1.0

1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE