Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

딥러닝 기반 한국어 개체명 인식의 평가와 오류 분석 연구

Full metadata record
DC Field Value Language
dc.contributor.author유현조-
dc.contributor.author송영숙-
dc.contributor.author김민수-
dc.contributor.author윤기현-
dc.contributor.author정유남-
dc.date.accessioned2023-03-08T11:57:12Z-
dc.date.available2023-03-08T11:57:12Z-
dc.date.issued2021-
dc.identifier.issn1229-4039-
dc.identifier.issn2734-0481-
dc.identifier.urihttps://scholarworks.bwise.kr/cau/handle/2019.sw.cau/62795-
dc.description.abstractNamed entity recognition is a natural language processing task that recognizes and classifies named entities in an unstructured text. The targets of NER are not limited to typical proper names for persons, locations and organizations, but also date, time and quantity expressions and can be further expanded to names of events, animals, plants, materials and other encyclopedic entities. A real-world NER system is also expected to be tuned to process domain-specific terminologies. In this study, the researchers built and tested a BERT based Korean NER system and proposed methods for evaluation and error analysis. The study trained the system with 140K word NER corpus and evaluated with 60K test. Error types are proposed to be categorized into four classes: detection, boundary, segmentation, and labelling. Error rates are found to vary greatly from 1% to 30% between entity labels, which are grouped into the most accurate time and quantity expressions, relatively accurate proper names, and highly erroneous terminologies. We expect that the error analysis will provide insights for finding a better way of data collection and post-processing correction.-
dc.format.extent26-
dc.language한국어-
dc.language.isoKOR-
dc.publisher한국언어학회-
dc.title딥러닝 기반 한국어 개체명 인식의 평가와 오류 분석 연구-
dc.title.alternativeError Analysis and Evaluation of Deep-learning Based Korean Named Entity Recognition-
dc.typeArticle-
dc.identifier.doi10.18855/lisoko.2021.46.3.010-
dc.identifier.bibliographicCitation언어, v.46, no.3, pp 803 - 828-
dc.identifier.kciidART002760413-
dc.description.isOpenAccessN-
dc.citation.endPage828-
dc.citation.number3-
dc.citation.startPage803-
dc.citation.title언어-
dc.citation.volume46-
dc.publisher.location대한민국-
dc.subject.keywordAuthornamed entity recognition-
dc.subject.keywordAuthorKorean language-
dc.subject.keywordAuthornatural language processing-
dc.subject.keywordAuthorproper name-
dc.subject.keywordAuthorterminology-
dc.description.journalRegisteredClasskci-
Files in This Item
There are no files associated with this item.
Appears in
Collections
The Office of Research Affairs > Affiliated Research Institute > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE