Tree-Pattern-Based Clone Detection with High Precision and Recall
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Hyo-Sub | - |
dc.contributor.author | Choi, Myung-Ryul | - |
dc.contributor.author | Doh, Kyung-Goo | - |
dc.date.accessioned | 2021-06-22T12:01:19Z | - |
dc.date.available | 2021-06-22T12:01:19Z | - |
dc.date.created | 2021-01-21 | - |
dc.date.issued | 2018-05 | - |
dc.identifier.issn | 1976-7277 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/6216 | - |
dc.description.abstract | The paper proposes a code-clone detection method that gives the highest possible precision and recall, without giving much attention to efficiency and scalability. The goal is to automatically create a reliable reference corpus that can be used as a basis for evaluating the precision and recall of clone detection tools. The algorithm takes an abstract-syntax-tree representation of source code and thoroughly examines every possible pair of all duplicate tree patterns in the tree, while avoiding unnecessary and duplicated comparisons wherever possible. The largest possible duplicate patterns are then collected in the set of pattern clusters that are used to identify code clones. The method is implemented and evaluated for a standard set of open-source Java applications. The experimental result shows very high precision and recall. False-negative clones missed by our method are all non-contiguous clones. Finally, the concept of neighbor patterns, which can be used to improve recall by detecting non-contiguous clones and intertwined clones, is proposed. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | 한국인터넷정보학회 | - |
dc.title | Tree-Pattern-Based Clone Detection with High Precision and Recall | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Choi, Myung-Ryul | - |
dc.contributor.affiliatedAuthor | Doh, Kyung-Goo | - |
dc.identifier.doi | 10.3837/tiis.2018.05.002 | - |
dc.identifier.scopusid | 2-s2.0-85047933470 | - |
dc.identifier.wosid | 000434019100002 | - |
dc.identifier.bibliographicCitation | KSII Transactions on Internet and Information Systems, v.12, no.5, pp.1932 - 1950 | - |
dc.relation.isPartOf | KSII Transactions on Internet and Information Systems | - |
dc.citation.title | KSII Transactions on Internet and Information Systems | - |
dc.citation.volume | 12 | - |
dc.citation.number | 5 | - |
dc.citation.startPage | 1932 | - |
dc.citation.endPage | 1950 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.identifier.kciid | ART002351947 | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.description.journalRegisteredClass | kci | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.subject.keywordPlus | CODE | - |
dc.subject.keywordAuthor | Software maintenance | - |
dc.subject.keywordAuthor | code clone | - |
dc.subject.keywordAuthor | clone detection | - |
dc.subject.keywordAuthor | abstract syntax tree | - |
dc.identifier.url | http://itiis.org/digital-library/manuscript/2000 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.