Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

CFP-AL: Combining Model Features and Prediction for Active Learning in Sentence Classification

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Keuntae-
dc.contributor.authorChoi, Yong Suk-
dc.date.accessioned2025-02-04T08:00:10Z-
dc.date.available2025-02-04T08:00:10Z-
dc.date.issued2025-01-
dc.identifier.issn2076-3417-
dc.identifier.issn2076-3417-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206366-
dc.description.abstractActive learning has been a research area conducted across various domains for a long time, from traditional machine learning to the latest deep learning research. Particularly, obtaining high-quality labeled datasets for supervised learning requires human annotation, and an effective active learning strategy can greatly reduce annotation costs. In this study, we propose a new insight, CFP-AL (Combining model Features and Prediction for Active Learning), from the perspective of feature space by analyzing and diagnosing methods that have shown good performance in NLP (Natural Language Processing) sentence classification. According to our analysis, while previous active learning strategies that focus on finding data near the decision boundary to facilitate classifier tuning are effective, there are very few data points near the decision boundary. Therefore, a more detailed active learning strategy is needed beyond simply finding data near the decision boundary or data with high uncertainty. Based on this analysis, we propose CFP-AL, which considers the model's feature space, and it demonstrated the best performance across six tasks and also outperformed others in three Out-Of-Domain (OOD) tasks. While suggesting that data sampling through CFP-AL is the most differential classification standard, it showed novelty in suggesting a method to overcome the anisotropy phenomenon of supervised models. Additionally, through various comparative experiments with basic methods, we analyzed which data are most beneficial or harmful for model training. Through our research, researchers will be able to expand into the area of considering features in active learning, which has been difficult so far.-
dc.format.extent15-
dc.language영어-
dc.language.isoENG-
dc.publisherMDPI-
dc.titleCFP-AL: Combining Model Features and Prediction for Active Learning in Sentence Classification-
dc.typeArticle-
dc.publisher.location스위스-
dc.identifier.doi10.3390/app15010482-
dc.identifier.scopusid2-s2.0-85214518196-
dc.identifier.wosid001393444000001-
dc.identifier.bibliographicCitationApplied Sciences-basel, v.15, no.1, pp 1 - 15-
dc.citation.titleApplied Sciences-basel-
dc.citation.volume15-
dc.citation.number1-
dc.citation.startPage1-
dc.citation.endPage15-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaMaterials Science-
dc.relation.journalResearchAreaPhysics-
dc.relation.journalWebOfScienceCategoryChemistry, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryEngineering, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryMaterials Science, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryPhysics, Applied-
dc.subject.keywordAuthorNLP (Natural Language Processing)-
dc.subject.keywordAuthorsupervised fine-tuning-
dc.subject.keywordAuthoractive learning-
dc.subject.keywordAuthorsentence classification-
dc.identifier.urlhttps://www.mdpi.com/2076-3417/15/1/482-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE