Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A multi-head adjacent attention-based pyramid layered model for nested named entity recognition

Full metadata record
DC Field Value Language
dc.contributor.authorCui, Shengmin-
dc.contributor.authorJoe, Inwhee-
dc.date.accessioned2023-07-05T02:45:45Z-
dc.date.available2023-07-05T02:45:45Z-
dc.date.created2022-10-06-
dc.date.issued2023-01-
dc.identifier.issn0941-0643-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/186140-
dc.description.abstractNamed entity recognition (NER) is one of the widely studied natural language processing tasks in recent years. Conventional solutions treat the NER as a sequence labeling problem, but these approaches cannot handle nested NER. This is due to the fact that nested NER refers to the case where one entity contains another entity and it is not feasible to tag each token with a single tag. The pyramid model stacks L flat NER layers for prediction, which subtly enumerates all spans with length less than or equal to L. However, the original model introduces a block consisting of a convolutional layer and a bidirectional long short-term memory (Bi-LSTM) layer as the decoder, which does not consider the dependency between adjacent inputs and the Bi-LSTM cannot perform parallel computation on sequential inputs. For the purpose of improving performance and reducing the forward computation, we propose a Multi-Head Adjacent Attention-based Pyramid Layered model. In addition, when constructing a pyramid structure for span representation, the information of the intermediate words has more proportion than words on the two sides. To address this imbalance in the span representation, we fuse the output of the attention layer with the features of head and tail words when doing classification. We conducted experiments on nested NER datasets such as GENIA, SciERC, and ADE to validate the effectiveness of our proposed model.-
dc.language영어-
dc.language.isoen-
dc.publisherSPRINGER LONDON LTD-
dc.titleA multi-head adjacent attention-based pyramid layered model for nested named entity recognition-
dc.typeArticle-
dc.contributor.affiliatedAuthorJoe, Inwhee-
dc.identifier.doi10.1007/s00521-022-07747-8-
dc.identifier.scopusid2-s2.0-85137329817-
dc.identifier.wosid000849284000001-
dc.identifier.bibliographicCitationNEURAL COMPUTING & APPLICATIONS, v.35, no.3, pp.2561 - 5274-
dc.relation.isPartOfNEURAL COMPUTING & APPLICATIONS-
dc.citation.titleNEURAL COMPUTING & APPLICATIONS-
dc.citation.volume35-
dc.citation.number3-
dc.citation.startPage2561-
dc.citation.endPage5274-
dc.type.rimsART-
dc.type.docTypeArticle; Early Access-
dc.description.journalClass1-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.subject.keywordPlusEXTRACTION-
dc.subject.keywordAuthorNested named entity recognition-
dc.subject.keywordAuthorNamed entity recognition-
dc.subject.keywordAuthorAttention-
dc.subject.keywordAuthorPyramid-
dc.subject.keywordAuthorNatural language processing-
dc.identifier.urlhttps://link.springer.com/article/10.1007/s00521-022-07747-8-
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE