Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A multi-head adjacent attention-based pyramid layered model for nested named entity recognitionopen access

Authors
Cui, ShengminJoe, Inwhee
Issue Date
Jan-2023
Publisher
SPRINGER LONDON LTD
Keywords
Nested named entity recognition; Named entity recognition; Attention; Pyramid; Natural language processing
Citation
NEURAL COMPUTING & APPLICATIONS, v.35, no.3, pp.2561 - 5274
Indexed
SCIE
SCOPUS
Journal Title
NEURAL COMPUTING & APPLICATIONS
Volume
35
Number
3
Start Page
2561
End Page
5274
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/186140
DOI
10.1007/s00521-022-07747-8
ISSN
0941-0643
Abstract
Named entity recognition (NER) is one of the widely studied natural language processing tasks in recent years. Conventional solutions treat the NER as a sequence labeling problem, but these approaches cannot handle nested NER. This is due to the fact that nested NER refers to the case where one entity contains another entity and it is not feasible to tag each token with a single tag. The pyramid model stacks L flat NER layers for prediction, which subtly enumerates all spans with length less than or equal to L. However, the original model introduces a block consisting of a convolutional layer and a bidirectional long short-term memory (Bi-LSTM) layer as the decoder, which does not consider the dependency between adjacent inputs and the Bi-LSTM cannot perform parallel computation on sequential inputs. For the purpose of improving performance and reducing the forward computation, we propose a Multi-Head Adjacent Attention-based Pyramid Layered model. In addition, when constructing a pyramid structure for span representation, the information of the intermediate words has more proportion than words on the two sides. To address this imbalance in the span representation, we fuse the output of the attention layer with the features of head and tail words when doing classification. We conducted experiments on nested NER datasets such as GENIA, SciERC, and ADE to validate the effectiveness of our proposed model.
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Joe, Inwhee photo

Joe, Inwhee
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE