Cited 0 time in
A multi-head adjacent attention-based pyramid layered model for nested named entity recognition
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Cui, Shengmin | - |
| dc.contributor.author | Joe, Inwhee | - |
| dc.date.accessioned | 2023-07-05T02:45:45Z | - |
| dc.date.available | 2023-07-05T02:45:45Z | - |
| dc.date.created | 2022-10-06 | - |
| dc.date.issued | 2023-01 | - |
| dc.identifier.issn | 0941-0643 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/186140 | - |
| dc.description.abstract | Named entity recognition (NER) is one of the widely studied natural language processing tasks in recent years. Conventional solutions treat the NER as a sequence labeling problem, but these approaches cannot handle nested NER. This is due to the fact that nested NER refers to the case where one entity contains another entity and it is not feasible to tag each token with a single tag. The pyramid model stacks L flat NER layers for prediction, which subtly enumerates all spans with length less than or equal to L. However, the original model introduces a block consisting of a convolutional layer and a bidirectional long short-term memory (Bi-LSTM) layer as the decoder, which does not consider the dependency between adjacent inputs and the Bi-LSTM cannot perform parallel computation on sequential inputs. For the purpose of improving performance and reducing the forward computation, we propose a Multi-Head Adjacent Attention-based Pyramid Layered model. In addition, when constructing a pyramid structure for span representation, the information of the intermediate words has more proportion than words on the two sides. To address this imbalance in the span representation, we fuse the output of the attention layer with the features of head and tail words when doing classification. We conducted experiments on nested NER datasets such as GENIA, SciERC, and ADE to validate the effectiveness of our proposed model. | - |
| dc.language | 영어 | - |
| dc.language.iso | en | - |
| dc.publisher | SPRINGER LONDON LTD | - |
| dc.title | A multi-head adjacent attention-based pyramid layered model for nested named entity recognition | - |
| dc.type | Article | - |
| dc.contributor.affiliatedAuthor | Joe, Inwhee | - |
| dc.identifier.doi | 10.1007/s00521-022-07747-8 | - |
| dc.identifier.scopusid | 2-s2.0-85137329817 | - |
| dc.identifier.wosid | 000849284000001 | - |
| dc.identifier.bibliographicCitation | NEURAL COMPUTING & APPLICATIONS, v.35, no.3, pp.2561 - 5274 | - |
| dc.relation.isPartOf | NEURAL COMPUTING & APPLICATIONS | - |
| dc.citation.title | NEURAL COMPUTING & APPLICATIONS | - |
| dc.citation.volume | 35 | - |
| dc.citation.number | 3 | - |
| dc.citation.startPage | 2561 | - |
| dc.citation.endPage | 5274 | - |
| dc.type.rims | ART | - |
| dc.type.docType | Article; Early Access | - |
| dc.description.journalClass | 1 | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.subject.keywordPlus | EXTRACTION | - |
| dc.subject.keywordAuthor | Nested named entity recognition | - |
| dc.subject.keywordAuthor | Named entity recognition | - |
| dc.subject.keywordAuthor | Attention | - |
| dc.subject.keywordAuthor | Pyramid | - |
| dc.subject.keywordAuthor | Natural language processing | - |
| dc.identifier.url | https://link.springer.com/article/10.1007/s00521-022-07747-8 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
