Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Cross-domain Chinese Word Segmentation Based on New Word Discovery

Full metadata record
DC Field Value Language
dc.contributor.authorZhang, Jun-
dc.contributor.authorLai, Zhipeng-
dc.contributor.authorLi, Xue-
dc.contributor.authorNing, Gengxin-
dc.contributor.authorYang, Cui-
dc.date.accessioned2023-11-24T02:36:19Z-
dc.date.available2023-11-24T02:36:19Z-
dc.date.issued2022-09-
dc.identifier.issn1009-5896-
dc.identifier.urihttps://scholarworks.bwise.kr/erica/handle/2021.sw.erica/115726-
dc.description.abstractDeep Neural Network (DNN) is the major method in current Chinese word segmentation. However, its performance is significantly degraded when the network trained for one domain is used in other domains due to the Out Of Vocabulary (OOV) words and expression gaps. In this paper, a cross domain Chinese word segmentation system based on new word discovery is built to handle the OOV word and expression gap problems. An unsupervised new word discovery algorithm based on vector enhanced mutual information and weighted adjacency entropy, and a Chinese word segmentation model based on adversarial training are also proposed to improve the performance of the baseline system. Experimental results show that the proposed method is superior to the conventional methods in the OOV rates, precisions, recalls and F-scores. © 2022 Science Press. All rights reserved.-
dc.format.extent8-
dc.language중국어-
dc.language.isoCHI-
dc.publisherZhongguo Kexueyuan-
dc.titleCross-domain Chinese Word Segmentation Based on New Word Discovery-
dc.title.alternative基于新词发现的跨领域中文分词方法-
dc.typeArticle-
dc.publisher.location중국-
dc.identifier.doi10.11999/JEIT210675-
dc.identifier.scopusid2-s2.0-85139420044-
dc.identifier.wosid000889371900006-
dc.identifier.bibliographicCitationDianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, v.44, no.9, pp 3241 - 3248-
dc.citation.titleDianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology-
dc.citation.volume44-
dc.citation.number9-
dc.citation.startPage3241-
dc.citation.endPage3248-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClassesci-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordAuthorAdversarial training-
dc.subject.keywordAuthorChinese word segmentation-
dc.subject.keywordAuthorCross-domain-
dc.subject.keywordAuthorNew word discovery-
dc.subject.keywordAuthorVector enhancement mutual information-
dc.identifier.urlhttps://jeit.ac.cn/en/article/doi/10.11999/JEIT210675-
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher ZHANG, Jun photo

ZHANG, Jun
ERICA 공학대학 (SCHOOL OF ELECTRICAL ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE