Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Semi-Supervised Image Captioning by Adversarially Propagating Labeled Dataopen access

Authors
Kim, Dong-JinOh, Tae-HyunChoi, JinsooKweon, In So
Issue Date
Jul-2024
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Bridges; Data models; generative adversarial networks; Image captioning; Natural languages; semi-supervised learning; Semisupervised learning; Task analysis; Training; unpaired captioning; Visualization
Citation
IEEE Access, v.12, pp 93580 - 93592
Pages
13
Indexed
SCIE
SCOPUS
Journal Title
IEEE Access
Volume
12
Start Page
93580
End Page
93592
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/197763
DOI
10.1109/ACCESS.2024.3423790
ISSN
2169-3536
2169-3536
Abstract
We present a novel data-efficient <italic>semi-supervised</italic> framework to improve the generalization of image captioning models. Constructing a large-scale labeled image captioning dataset is expensive in terms of labor, time, and cost. In contrast to manually annotating all the training samples, separately collecting uni-modal datasets is immensely easier, <italic>e.g</italic>. a large-scale image dataset and a sentence dataset.We leverage such massive <italic>unpaired</italic> image and caption data upon standard paired data by learning to associate them. To this end, our proposed semi-supervised learning method assigns pseudo-labels to unpaired samples in an adversarial learning fashion, where the joint distribution of image and caption is learned. This approach shows noticeable performance improvement even in challenging scenarios, including out-of-task data and web-crawled data. We also show that our proposed method is theoretically well-motivated and has a favorable global optimal property. Our extensive and comprehensive empirical results on captioning datasets, followed by a comprehensive analysis of the scarcely-paired COCO dataset, demonstrate the consistent effectiveness of our method compared to competing ones.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Dong Jin photo

Kim, Dong Jin
COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)
Read more

Altmetrics

Total Views & Downloads

BROWSE