Tree pattern expression for extracting information from syntactically parsed text corpora
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Choi, Yong Suk | - |
dc.date.accessioned | 2022-07-16T22:23:02Z | - |
dc.date.available | 2022-07-16T22:23:02Z | - |
dc.date.created | 2021-05-12 | - |
dc.date.issued | 2011-01 | - |
dc.identifier.issn | 1384-5810 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/169283 | - |
dc.description.abstract | With the public availability of a number of syntactically parsed text corpora, it has been increasingly important to efficiently extract desired information from such corpora. Many conventional works extract a desired text part by matching the parse tree of each sentence to a query that is represented as a structural form of relational predicates expressing a common structural pattern of desired text parts. However, although those works can be useful for limited types of simple queries, they are not very efficient in general because query formulations are sometimes very complicated for complex patterns of desired text parts and query matching tasks are likely to be exponentially time-consuming when considering a variety of complex sentential structures in a text corpus. In order to overcome such inadequacy, we present a novel tree pattern expression (TPE) that can represent various structural patterns intuitively and reduce pattern-matching complexity significantly. This paper first proposes TPE and its pattern-matching algorithm, and then theoretically analyzes the complexity of the proposed pattern-matching algorithm. It also illustrates a TPE-based information extraction system, which is applied to real text mining in a bio-text corpus. It finally shows some experimental results with some discussions in comparison with other systems. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | SPRINGER | - |
dc.title | Tree pattern expression for extracting information from syntactically parsed text corpora | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Choi, Yong Suk | - |
dc.identifier.doi | 10.1007/s10618-010-0184-8 | - |
dc.identifier.scopusid | 2-s2.0-78651373156 | - |
dc.identifier.wosid | 000286001100007 | - |
dc.identifier.bibliographicCitation | DATA MINING AND KNOWLEDGE DISCOVERY, v.22, no.1-2, pp.211 - 231 | - |
dc.relation.isPartOf | DATA MINING AND KNOWLEDGE DISCOVERY | - |
dc.citation.title | DATA MINING AND KNOWLEDGE DISCOVERY | - |
dc.citation.volume | 22 | - |
dc.citation.number | 1-2 | - |
dc.citation.startPage | 211 | - |
dc.citation.endPage | 231 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.subject.keywordPlus | Artificial intelligence | - |
dc.subject.keywordPlus | Data mining | - |
dc.subject.keywordPlus | Forestry | - |
dc.subject.keywordPlus | Information retrieval | - |
dc.subject.keywordPlus | Pattern matching | - |
dc.subject.keywordPlus | Text processing | - |
dc.subject.keywordAuthor | Tree pattern | - |
dc.subject.keywordAuthor | Information extraction | - |
dc.subject.keywordAuthor | Tree pattern-matching algorithm | - |
dc.identifier.url | https://link.springer.com/article/10.1007/s10618-010-0184-8 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365
COPYRIGHT © 2021 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.