TPEMatcher: A tool for searching in parsed text corpora
- Authors
- Choi, Yong Suk
- Issue Date
- Dec-2011
- Publisher
- ELSEVIER
- Keywords
- Corpus search tool; Tree pattern querying; Tree pattern matching; Parsed text corpora; Text mining
- Citation
- KNOWLEDGE-BASED SYSTEMS, v.24, no.8, pp.1139 - 1150
- Indexed
- SCIE
SCOPUS
- Journal Title
- KNOWLEDGE-BASED SYSTEMS
- Volume
- 24
- Number
- 8
- Start Page
- 1139
- End Page
- 1150
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/166929
- DOI
- 10.1016/j.knosys.2011.04.009
- ISSN
- 0950-7051
- Abstract
- Recently, due to the widespread on-line availability of syntactically annotated text corpora, some automated tools for searching in such text corpora have gained great attention. Generally, those conventional corpus search tools use a decomposition-matching-merging method based on relational predicates for matching a tree pattern query to the desired parts of text corpora. Thus, their query formulation and expressivity are often complicated due to poorly understood query formalisms, and their searching tasks may require a big computational overhead due to a large number of repeated trials of matching tree patterns. To overcome these difficulties, we present TPEMatcher, a tool for searching in parsed text corpora. TPEMatcher provides not only an efficient way of query formulation and searching but also a good query expressivity based on concise syntax and semantics of tree pattern query. We also demonstrate that TPEMatcher can be effectively used for a text mining in practice with its useful interface providing in-depth details of search results.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.