Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data

Kim, Dong-Jin; Oh, Tae-Hyun; Choi, Jinsoo; Kweon, In So

doi:10.1109/ACCESS.2024.3423790

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Dong-Jin	-
dc.contributor.author	Oh, Tae-Hyun	-
dc.contributor.author	Choi, Jinsoo	-
dc.contributor.author	Kweon, In So	-
dc.date.accessioned	2024-11-28T17:00:46Z	-
dc.date.available	2024-11-28T17:00:46Z	-
dc.date.issued	2024-07	-
dc.identifier.issn	2169-3536	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/197763	-
dc.description.abstract	We present a novel data-efficient <italic>semi-supervised</italic> framework to improve the generalization of image captioning models. Constructing a large-scale labeled image captioning dataset is expensive in terms of labor, time, and cost. In contrast to manually annotating all the training samples, separately collecting uni-modal datasets is immensely easier, <italic>e.g</italic>. a large-scale image dataset and a sentence dataset.We leverage such massive <italic>unpaired</italic> image and caption data upon standard paired data by learning to associate them. To this end, our proposed semi-supervised learning method assigns pseudo-labels to unpaired samples in an adversarial learning fashion, where the joint distribution of image and caption is learned. This approach shows noticeable performance improvement even in challenging scenarios, including out-of-task data and web-crawled data. We also show that our proposed method is theoretically well-motivated and has a favorable global optimal property. Our extensive and comprehensive empirical results on captioning datasets, followed by a comprehensive analysis of the scarcely-paired COCO dataset, demonstrate the consistent effectiveness of our method compared to competing ones.	-
dc.format.extent	13	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ACCESS.2024.3423790	-
dc.identifier.scopusid	2-s2.0-85197486437	-
dc.identifier.wosid	001269748800001	-
dc.identifier.bibliographicCitation	IEEE Access, v.12, pp 93580 - 93592	-
dc.citation.title	IEEE Access	-
dc.citation.volume	12	-
dc.citation.startPage	93580	-
dc.citation.endPage	93592	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.subject.keywordPlus	Data visualization	-
dc.subject.keywordPlus	Generative adversarial networks	-
dc.subject.keywordPlus	Image enhancement	-
dc.subject.keywordPlus	Job analysis	-
dc.subject.keywordPlus	Web crawler	-
dc.subject.keywordAuthor	Bridges	-
dc.subject.keywordAuthor	Data models	-
dc.subject.keywordAuthor	generative adversarial networks	-
dc.subject.keywordAuthor	Image captioning	-
dc.subject.keywordAuthor	Natural languages	-
dc.subject.keywordAuthor	semi-supervised learning	-
dc.subject.keywordAuthor	Semisupervised learning	-
dc.subject.keywordAuthor	Task analysis	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	unpaired captioning	-
dc.subject.keywordAuthor	Visualization	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10586974	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Dong Jin photo

Kim, Dong Jin: COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE