Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Adaptive Named Entity Recognition Using Distant Supervision for Contemporary Written Texts

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Juae-
dc.contributor.authorKim, Yejin-
dc.contributor.authorKang, Sangwoo-
dc.contributor.authorSeo, Jungyun-
dc.date.accessioned2021-08-05T01:40:42Z-
dc.date.available2021-08-05T01:40:42Z-
dc.date.created2021-04-05-
dc.date.issued2021-03-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/81826-
dc.description.abstractNamed entity recognition (NER) is the process of categorizing named entities in a given text that suffers from the lack of labeled corpora, which is a long-standing issue. Deep neural networks have been successfully applied to NER tasks. However, they require a large number of annotated data. Regardless of the number of data made available, annotation requires significant human effort, which is expensive and time-consuming. Moreover, collecting labeled data that reflect contemporary surrounding statuses requires exhaustive follow-up and incurs correspondingly higher costs. Current NERs typically focus on the supervised learning of hand-crafted data. The most well-known dataset for NER shared tasks, which was released at the 2003 Conference on Natural Language Learning, is used for basic training and evaluation. Although the data are qualified, the database has low coverage of timely material. In this paper, we illustrate methods for swiftly labeling up-to-date data via distant supervision. To tackle the difficulty of annotating contemporary written texts, we generate labeled data articles that reflect the latest issues. We evaluated the proposed methods with bidirectional long short-term memory conditional random-field architecture using static and contextualized embedding methods. Our proposed models perform higher than state-of-the-art methods with average F1-scores 3.09% better with weakly labeledWikipedia data and 3.47% better with Cable News Network data. When using the NER model with Flair embedding, our method shows 1.50 and 3.26% higher F1-scores with weakly labeled Wikipedia and news data, respectively. Qualitatively, the proposed model also performs better when extracting contemporary keywords. CCBYNCND-
dc.language영어-
dc.language.isoen-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.relation.isPartOfIEEE Access-
dc.titleAdaptive Named Entity Recognition Using Distant Supervision for Contemporary Written Texts-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid000673954500001-
dc.identifier.doi10.1109/ACCESS.2021.3067315-
dc.identifier.bibliographicCitationIEEE Access, v.9, pp.80405 - 80414-
dc.description.isOpenAccessN-
dc.identifier.scopusid2-s2.0-85103297425-
dc.citation.endPage80414-
dc.citation.startPage80405-
dc.citation.titleIEEE Access-
dc.citation.volume9-
dc.contributor.affiliatedAuthorKang, Sangwoo-
dc.type.docTypeArticle in Press-
dc.subject.keywordAuthorComputational and artificial intelligence-
dc.subject.keywordAuthorElectronic publishing-
dc.subject.keywordAuthorEncyclopedias-
dc.subject.keywordAuthorInformation services-
dc.subject.keywordAuthorInternet-
dc.subject.keywordAuthornamed entity recognition-
dc.subject.keywordAuthornatural language processing-
dc.subject.keywordAuthorneural networks-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorTransfer learning-
dc.subject.keywordAuthortransfer learning-
dc.subject.keywordAuthorweakly supervised learning-
dc.subject.keywordPlusCharacter recognition-
dc.subject.keywordPlusDeep learning-
dc.subject.keywordPlusDeep neural networks-
dc.subject.keywordPlusEmbeddings-
dc.subject.keywordPlusInformation dissemination-
dc.subject.keywordPlusLabeled data-
dc.subject.keywordPlusPetroleum reservoir evaluation-
dc.subject.keywordPlusBasic training-
dc.subject.keywordPlusConditional random field-
dc.subject.keywordPlusEmbedding method-
dc.subject.keywordPlusNamed entities-
dc.subject.keywordPlusNamed entity recognition-
dc.subject.keywordPlusNatural language learning-
dc.subject.keywordPlusNumber of datum-
dc.subject.keywordPlusState-of-the-art methods-
dc.subject.keywordPlusNatural language processing systems-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 소프트웨어학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kang, Sang Woo photo

Kang, Sang Woo
College of IT Convergence (Department of Software)
Read more

Altmetrics

Total Views & Downloads

BROWSE