Weakly labeled data augmentation for social media named entity recognition
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kim, Juae | - |
dc.contributor.author | Kim, Yejin | - |
dc.contributor.author | Kang, Sangwoo | - |
dc.date.accessioned | 2022-10-27T09:40:04Z | - |
dc.date.available | 2022-10-27T09:40:04Z | - |
dc.date.created | 2022-10-27 | - |
dc.date.issued | 2022-12 | - |
dc.identifier.issn | 0957-4174 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/85854 | - |
dc.description.abstract | Named entity recognition is a task that extracts entities corresponding to predefined categories. Although NER is important in processing user-generated texts such as those obtained from social media, it remains challenging because such texts tend to contain numerous unseen words or abbreviations. To address this issue, we propose two methods for weakly labeled data generation that can extract named entities from social media texts more effectively: alias augmentation and typo augmentation. Using these methods, weakly labeled data are generated through the automatic annotation of unlabeled Wikipedia texts and Tweets and then trained through transfer learning. Our experimental results suggest that the proposed approach improves NER performance, with our best F1-score of 51.43% representing the highest score ever reported. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | PERGAMON-ELSEVIER SCIENCE LTD | - |
dc.relation.isPartOf | EXPERT SYSTEMS WITH APPLICATIONS | - |
dc.title | Weakly labeled data augmentation for social media named entity recognition | - |
dc.type | Article | - |
dc.type.rims | ART | - |
dc.description.journalClass | 1 | - |
dc.identifier.wosid | 000856772000003 | - |
dc.identifier.doi | 10.1016/j.eswa.2022.118217 | - |
dc.identifier.bibliographicCitation | EXPERT SYSTEMS WITH APPLICATIONS, v.209 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.scopusid | 2-s2.0-85135360064 | - |
dc.citation.title | EXPERT SYSTEMS WITH APPLICATIONS | - |
dc.citation.volume | 209 | - |
dc.contributor.affiliatedAuthor | Kang, Sangwoo | - |
dc.type.docType | Article | - |
dc.subject.keywordAuthor | Named entity recognition | - |
dc.subject.keywordAuthor | Social-media text mining | - |
dc.subject.keywordAuthor | Weakly labeled data | - |
dc.subject.keywordAuthor | Transfer learning | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Operations Research & Management Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Operations Research & Management Science | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114
COPYRIGHT 2020 Gachon University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.