Detailed Information

Cited 11 time in webofscience Cited 13 time in scopus
Metadata Downloads

Binary-Classifiers-Enabled Filters for Semi-Supervised Learning

Authors
Kumar, T.[Kumar, T.]Park, J.[Park, J.]Ali, M.S.[Ali, M.S.]Shahab, Uddin A.F.M.[Shahab, Uddin A.F.M.]Ko, J.H.[Ko, J.H.]Bae, S.[Bae, S.]
Issue Date
2021
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Audio classification; binary classification; image classification; semi-supervised learning; text classification
Citation
IEEE Access, v.9, pp.167663 - 167673
Indexed
SCIE
SCOPUS
Journal Title
IEEE Access
Volume
9
Start Page
167663
End Page
167673
URI
https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/92777
DOI
10.1109/ACCESS.2021.3124200
ISSN
2169-3536
Abstract
A typical semi-supervised learning-based scheme is based on training a single model for labeled data. For unlabeled data, it uses the pseudo-labeling method to obtain labels. However, the samples during pseudo-labeling are often filtered using a probability threshold, which suffers from the challenge of effective threshold selection. In the case of a high probability threshold, correct samples may not be labeled, and in the case of a low threshold, samples can be wrongly labeled. This threshold issue degrades the overall performance of the model. This paper addresses this vital issue by proposing a novel approach of SSL named Binary-Classifiers-Enabled Filters for Semi-Supervised Learning (BSSL) for labeling the unlabeled data by using binary classifiers as data filters. That is, we train binary classifiers dedicated to each class. After training, we propose three methods for labeling the unlabeled data; cascading, non-cascading, and rank-based binary classifiers. Our extensive experiment shows rank-based binary classifiers are the best choice for labeling the data. Our approach eliminates threshold selection to improve the performance of the model. Comprehensive experiments are performed to demonstrate the effectiveness of our approach on a variety of domains, including image classification, text classification and audio classification, datasets including MNIST, fashion-MNIST, EuroSat, ESC10, Free Spoken Digit dataset, Audio Emotion recognition, reuter and mice protein dataset. Rank based binary classifiers (BSSL) approach achieves absolute performance of atleast 10% and 5% over supervised learning(SL) and SSL, respectively on audio datasets in different number of sample cases except RAVDESS dataset. Moreover, BSSL shows tremendous performance on image datasets specifically when number of samples is very small. Overall, BSSL outperformed the purely supervised learning approach and SSL pseudo-labeling approaches in different number of samples cases. Author
Files in This Item
There are no files associated with this item.
Appears in
Collections
Information and Communication Engineering > School of Electronic and Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher KO, JONG HWAN photo

KO, JONG HWAN
Information and Communication Engineering (Electronic and Electrical Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE