An Empirical Comparison of Model-Agnostic Techniques for Defect Prediction Models
- Authors
- Lee,Gichan; Scott Uk-Jin Lee
- Issue Date
- Mar-2023
- Publisher
- IEEE
- Keywords
- Explainable Software Analytics; Software Quality Assurance; Defect Prediction Models; Model-Agnostic Technique
- Citation
- 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), pp 179 - 189
- Pages
- 11
- Indexed
- OTHER
- Journal Title
- 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)
- Start Page
- 179
- End Page
- 189
- URI
- https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/113331
- DOI
- 10.1109/SANER56733.2023.00026
- ISSN
- 0000-0000
- Abstract
- Recently, software defect prediction studies have attempted to make black-box defect prediction models explainable and actionable. State-of-the-art defect prediction studies have utilized various model-agnostic techniques derived from the explainable AI domain as key tools to make the predictions easier for practitioners to understand. However, it has not been sufficiently investigated whether there is inconsistent information within local explanations generated by different model-agnostic techniques when interpreting a defect prediction. If local explanations generated by heterogeneous model-agnostic techniques consist of different information, the derivable insights to understand and act upon defect predictions becomes less viable and it may cause ineffective or even incorrect defect corrections. In this research, we empirically analyzed 323,844 local explanations generated by three different model-agnostic techniques: (1) LIME, (2) SHAP, and (3) BreakDown. These local explanations were analyzed in terms of how the contributions of features were distributed, how the contributions were ranked, and whether the contributions were contradictory. We concluded that (i) different model-agnostic techniques provide practitioners with local explanations where average contributions of the top-ranked features are different; (ii) different model-agnostic techniques provide practitioners with local explanations consisting of different contribution rankings and inconsistent contribution directions of top-ranked features. Therefore, we recommend that practitioners should avoid using model-agnostic techniques interchangeably and must perform a multi-faceted manual validation when planning actions based on the local explanations.
- Files in This Item
-
Go to Link
- Appears in
Collections - COLLEGE OF COMPUTING > ERICA 컴퓨터학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.