Method for extracting valuable common structures from heterogeneous rooted and labeled tree data
- Authors
- Paik J.[Paik J.]; Nam J.[Nam J.]; Kim U.M.[Kim U.M.]; Won D.[Won D.]
- Issue Date
- 2014
- Keywords
- Label projection; Maximal frequent subtree; Pattern-growth approach; Subtree pattern; Tree mining
- Citation
- Journal of Information Science and Engineering, v.30, no.3, pp.787 - 817
- Indexed
- SCIE
SCOPUS
- Journal Title
- Journal of Information Science and Engineering
- Volume
- 30
- Number
- 3
- Start Page
- 787
- End Page
- 817
- URI
- https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/57856
- ISSN
- 1016-2364
- Abstract
- The most commonly adopted approach to find valuable information from tree data is to extract frequently occurring subtree patterns. Because mining frequent tree patternshas a wide range of applications such as XML mining, web usage mining, bioinformatics, and network multicast routing, many algorithms have been recently proposed to find the patterns. However, existing tree mining algorithms suffer from several serious pitfalls in finding frequent tree patterns from massive tree datasets. Some of the major problems are due to (1) the computationally high cost of the candidate maintenance, (2) the repetitious input dataset scans, and (3) the high memory dependency. These problems stem from the fact that most of the algorithms are based on the well-known apriori algorithm and have used anti-monotone property for candidate generation and frequency counting in them. To solve the problems, we apply the pattern-growth approach instead of the apriori approach, and choose to extract maximal frequent subtree patterns rather than frequent subtree patterns. We present several new definitions and evaluate the effectiveness of the proposed algorithm.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Computing and Informatics > Computer Science and Engineering > 1. Journal Articles
- Information and Communication Engineering > Department of Computer Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.