Active learning with logistic models featuring simultaneous variable and subject selection
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Wang, Zhanfeng | - |
dc.contributor.author | Kwon, Amy M. | - |
dc.contributor.author | Chang, Yuan-chin Ivan | - |
dc.date.accessioned | 2023-09-11T01:36:35Z | - |
dc.date.available | 2023-09-11T01:36:35Z | - |
dc.date.issued | 2022-02 | - |
dc.identifier.issn | 0361-0918 | - |
dc.identifier.issn | 1532-4141 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/115236 | - |
dc.description.abstract | Modern computing and communication technologies can make data collection procedures very efficient, while our ability to analyze and extract information from large data sets is hard-pressed to keep up with our capacity for data collection. If we are interested in learning a classification/prediction rule that was not considered in the original data collection procedure, then there is usually a lack of label information in the original data. Thus, how to start with minimum number of labeled data and aggressively selected most informative samples for being labeled becomes an important issue. The thoughts of active learning, with subjects to be selected sequentially without using label information, is a possible outlet for this situation. In addition, if we can identify variables, from the lengthy variable list of the data set, for constructing an effective classification rule with good interpretation ability will be better. Here, we propose an active learning procedure targeted the maximum area under the receiver operating characteristic curve via a commonly use logistic model, which can sequentially select the effective informative subjects and variables, simultaneously. We discuss the asymptotic properties of the proposed procedure and illustrate it with some synthesized and real data sets. © 2022 Taylor & Francis Group, LLC. | - |
dc.format.extent | 15 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Dekker | - |
dc.title | Active learning with logistic models featuring simultaneous variable and subject selection | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1080/03610918.2022.2037636 | - |
dc.identifier.scopusid | 2-s2.0-85125246304 | - |
dc.identifier.wosid | 000756469900001 | - |
dc.identifier.bibliographicCitation | Communications in Statistics Part B: Simulation and Computation, v.53, no.2, pp 1 - 15 | - |
dc.citation.title | Communications in Statistics Part B: Simulation and Computation | - |
dc.citation.volume | 53 | - |
dc.citation.number | 2 | - |
dc.citation.startPage | 1 | - |
dc.citation.endPage | 15 | - |
dc.type.docType | Article; Early Access | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Mathematics | - |
dc.relation.journalWebOfScienceCategory | Statistics & Probability | - |
dc.subject.keywordPlus | GENERALIZED LINEAR-MODELS | - |
dc.subject.keywordAuthor | Active learning | - |
dc.subject.keywordAuthor | Area under ROC curve | - |
dc.subject.keywordAuthor | Classification | - |
dc.subject.keywordAuthor | Local optimal design | - |
dc.subject.keywordAuthor | Shrinkage estimate | - |
dc.subject.keywordAuthor | Stopping time | - |
dc.identifier.url | https://www.tandfonline.com/doi/full/10.1080/03610918.2022.2037636 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.