Cited 0 time in
Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Dong-Jin | - |
| dc.contributor.author | Oh, Tae-Hyun | - |
| dc.contributor.author | Choi, Jinsoo | - |
| dc.contributor.author | Kweon, In So | - |
| dc.date.accessioned | 2024-11-28T17:00:46Z | - |
| dc.date.available | 2024-11-28T17:00:46Z | - |
| dc.date.issued | 2024-07 | - |
| dc.identifier.issn | 2169-3536 | - |
| dc.identifier.issn | 2169-3536 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/197763 | - |
| dc.description.abstract | We present a novel data-efficient <italic>semi-supervised</italic> framework to improve the generalization of image captioning models. Constructing a large-scale labeled image captioning dataset is expensive in terms of labor, time, and cost. In contrast to manually annotating all the training samples, separately collecting uni-modal datasets is immensely easier, <italic>e.g</italic>. a large-scale image dataset and a sentence dataset.We leverage such massive <italic>unpaired</italic> image and caption data upon standard paired data by learning to associate them. To this end, our proposed semi-supervised learning method assigns pseudo-labels to unpaired samples in an adversarial learning fashion, where the joint distribution of image and caption is learned. This approach shows noticeable performance improvement even in challenging scenarios, including out-of-task data and web-crawled data. We also show that our proposed method is theoretically well-motivated and has a favorable global optimal property. Our extensive and comprehensive empirical results on captioning datasets, followed by a comprehensive analysis of the scarcely-paired COCO dataset, demonstrate the consistent effectiveness of our method compared to competing ones. | - |
| dc.format.extent | 13 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ACCESS.2024.3423790 | - |
| dc.identifier.scopusid | 2-s2.0-85197486437 | - |
| dc.identifier.wosid | 001269748800001 | - |
| dc.identifier.bibliographicCitation | IEEE Access, v.12, pp 93580 - 93592 | - |
| dc.citation.title | IEEE Access | - |
| dc.citation.volume | 12 | - |
| dc.citation.startPage | 93580 | - |
| dc.citation.endPage | 93592 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Telecommunications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Telecommunications | - |
| dc.subject.keywordPlus | Data visualization | - |
| dc.subject.keywordPlus | Generative adversarial networks | - |
| dc.subject.keywordPlus | Image enhancement | - |
| dc.subject.keywordPlus | Job analysis | - |
| dc.subject.keywordPlus | Web crawler | - |
| dc.subject.keywordAuthor | Bridges | - |
| dc.subject.keywordAuthor | Data models | - |
| dc.subject.keywordAuthor | generative adversarial networks | - |
| dc.subject.keywordAuthor | Image captioning | - |
| dc.subject.keywordAuthor | Natural languages | - |
| dc.subject.keywordAuthor | semi-supervised learning | - |
| dc.subject.keywordAuthor | Semisupervised learning | - |
| dc.subject.keywordAuthor | Task analysis | - |
| dc.subject.keywordAuthor | Training | - |
| dc.subject.keywordAuthor | unpaired captioning | - |
| dc.subject.keywordAuthor | Visualization | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10586974 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
