An Empirical Comparison of Model-Agnostic Techniques for Defect Prediction Models
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee,Gichan | - |
dc.contributor.author | Scott Uk-Jin Lee | - |
dc.date.accessioned | 2023-07-05T05:46:37Z | - |
dc.date.available | 2023-07-05T05:46:37Z | - |
dc.date.issued | 2023-03 | - |
dc.identifier.issn | 0000-0000 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/113331 | - |
dc.description.abstract | Recently, software defect prediction studies have attempted to make black-box defect prediction models explainable and actionable. State-of-the-art defect prediction studies have utilized various model-agnostic techniques derived from the explainable AI domain as key tools to make the predictions easier for practitioners to understand. However, it has not been sufficiently investigated whether there is inconsistent information within local explanations generated by different model-agnostic techniques when interpreting a defect prediction. If local explanations generated by heterogeneous model-agnostic techniques consist of different information, the derivable insights to understand and act upon defect predictions becomes less viable and it may cause ineffective or even incorrect defect corrections. In this research, we empirically analyzed 323,844 local explanations generated by three different model-agnostic techniques: (1) LIME, (2) SHAP, and (3) BreakDown. These local explanations were analyzed in terms of how the contributions of features were distributed, how the contributions were ranked, and whether the contributions were contradictory. We concluded that (i) different model-agnostic techniques provide practitioners with local explanations where average contributions of the top-ranked features are different; (ii) different model-agnostic techniques provide practitioners with local explanations consisting of different contribution rankings and inconsistent contribution directions of top-ranked features. Therefore, we recommend that practitioners should avoid using model-agnostic techniques interchangeably and must perform a multi-faceted manual validation when planning actions based on the local explanations. | - |
dc.format.extent | 11 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | IEEE | - |
dc.title | An Empirical Comparison of Model-Agnostic Techniques for Defect Prediction Models | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1109/SANER56733.2023.00026 | - |
dc.identifier.scopusid | 2-s2.0-85156855563 | - |
dc.identifier.wosid | 001008282200016 | - |
dc.identifier.bibliographicCitation | 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), pp 179 - 189 | - |
dc.citation.title | 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) | - |
dc.citation.startPage | 179 | - |
dc.citation.endPage | 189 | - |
dc.type.docType | Proceedings Paper | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | other | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Software Engineering | - |
dc.subject.keywordAuthor | Explainable Software Analytics | - |
dc.subject.keywordAuthor | Software Quality Assurance | - |
dc.subject.keywordAuthor | Defect Prediction Models | - |
dc.subject.keywordAuthor | Model-Agnostic Technique | - |
dc.identifier.url | https://ieeexplore.ieee.org/document/10123486?arnumber=10123486&SID=EBSCO:edseee | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.