Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

RefCap: image captioning with referent objects attributes

Full metadata record
DC Field Value Language
dc.contributor.authorPark, Seokmok-
dc.contributor.authorPaik, Joonki-
dc.date.accessioned2024-01-23T02:00:21Z-
dc.date.available2024-01-23T02:00:21Z-
dc.date.issued2023-12-
dc.identifier.issn2045-2322-
dc.identifier.urihttps://scholarworks.bwise.kr/cau/handle/2019.sw.cau/71240-
dc.description.abstractIn recent years, significant progress has been made in visual-linguistic multi-modality research, leading to advancements in visual comprehension and its applications in computer vision tasks. One fundamental task in visual-linguistic understanding is image captioning, which involves generating human-understandable textual descriptions given an input image. This paper introduces a referring expression image captioning model that incorporates the supervision of interesting objects. Our model utilizes user-specified object keywords as a prefix to generate specific captions that are relevant to the target object. The model consists of three modules including: (i) visual grounding, (ii) referring object selection, and (iii) image captioning modules. To evaluate its performance, we conducted experiments on the RefCOCO and COCO captioning datasets. The experimental results demonstrate that our proposed method effectively generates meaningful captions aligned with users’ specific interests. © 2023, The Author(s).-
dc.language영어-
dc.language.isoENG-
dc.publisherNature Research-
dc.titleRefCap: image captioning with referent objects attributes-
dc.typeArticle-
dc.identifier.doi10.1038/s41598-023-48916-6-
dc.identifier.bibliographicCitationScientific Reports, v.13, no.1-
dc.description.isOpenAccessY-
dc.identifier.wosid001142614900042-
dc.identifier.scopusid2-s2.0-85178945681-
dc.citation.number1-
dc.citation.titleScientific Reports-
dc.citation.volume13-
dc.type.docTypeArticle-
dc.publisher.location영국-
dc.relation.journalResearchAreaScience & Technology - Other Topics-
dc.relation.journalWebOfScienceCategoryMultidisciplinary Sciences-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
Appears in
Collections
Graduate School of Advanced Imaging Sciences, Multimedia and Film > Department of Imaging Science and Arts > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Paik, Joon Ki photo

Paik, Joon Ki
첨단영상대학원 (영상학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE