Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

LMGAN: Linguistically Informed Semi-Supervised GAN with Multiple Generatorsopen access

Authors
Cho, WhanheeChoi, Yong Suk
Issue Date
Nov-2022
Publisher
MDPI
Keywords
semi-supervised GAN; semi-supervised learning; text classification
Citation
SENSORS, v.22, no.22, pp 1 - 17
Pages
17
Indexed
SCIE
SCOPUS
Journal Title
SENSORS
Volume
22
Number
22
Start Page
1
End Page
17
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/172853
DOI
10.3390/s22228761
ISSN
1424-8220
1424-8220
Abstract
Semi-supervised learning is one of the active research topics these days. There is a trial that solves semi-supervised text classification with a generative adversarial network (GAN). However, its generator has a limitation in producing fake data distributions that are similar to real data distributions. Since the real data distribution is frequently changing, the generator could not create adequate fake data. To overcome this problem, we present a novel approach for semi-supervised learning for text classification based on generative adversarial networks, Linguistically Informed SeMi-Supervised GAN with Multiple Generators, LMGAN. LMGAN uses trained bidirectional encoder representations from transformers (BERT) and the discriminator from GAN-BERT. In addition, LMGAN has multiple generators and utilizes the hidden layers of BERT. To reduce the discrepancy between the distribution of fake data and real data distribution, LMGAN uses fine-tuned BERT and the discriminator from GAN-BERT. However, since injecting fine-tuned BERT could induce incorrect fake data distribution, we utilize linguistically meaningful intermediate hidden layer outputs of BERT to enrich fake data distribution. Our model shows well-distributed fake data compared to the earlier GAN-based approach that failed to generate adequate high-quality fake data. Moreover, we can get better performances with extremely limited amounts of labeled data, up to 20.0%, compared to the baseline GAN-based model.
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE