Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Generative Bias for Robust Visual Question Answering

Full metadata record
DC Field Value Language
dc.contributor.authorCho, Jae Won-
dc.contributor.authorKim, Dong-Jin-
dc.contributor.authorRyu, Hyeonggon-
dc.contributor.authorKweon, In So-
dc.date.accessioned2023-09-11T01:51:50Z-
dc.date.available2023-09-11T01:51:50Z-
dc.date.issued2023-08-
dc.identifier.issn1063-6919-
dc.identifier.issn2575-7075-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/190374-
dc.description.abstractThe task of Visual Question Answering (VQA) is knownto be plagued by the issue of VQA models exploiting biases within the dataset to make its final prediction. Variousprevious ensemble based debiasing methods have been proposed where an additional model is purposefully trained tobe biased in order to train a robust target model. However, these methods compute the bias for a model simplyfrom the label statistics of the training data or from singlemodal branches. In this work, in order to better learn thebias a target VQA model suffers from, we propose a generative method to train the bias model directly from the targetmodel, called GenB. In particular, GenB employs a generative network to learn the bias in the target model througha combination of the adversarial objective and knowledgedistillation. We then debias our target model with GenB asa bias model, and show through extensive experiments theeffects of our method on various VQA bias datasets including VQA-CP2, VQA-CP1, GQA-OOD, and VQA-CE, andshow state-of-the-art results with the LXMERT architectureon VQA-CP2.-
dc.format.extent10-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE COMPUTER SOC-
dc.titleGenerative Bias for Robust Visual Question Answering-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/CVPR52729.2023.01124-
dc.identifier.scopusid2-s2.0-85210117401-
dc.identifier.wosid001062522103095-
dc.identifier.bibliographicCitation2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), pp 11681 - 11690-
dc.citation.title2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)-
dc.citation.startPage11681-
dc.citation.endPage11690-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.subject.keywordPlusVisual languages-
dc.subject.keywordAuthorlanguage-
dc.subject.keywordAuthorreasoning-
dc.subject.keywordAuthorVision-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10205250-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Dong Jin photo

Kim, Dong Jin
COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)
Read more

Altmetrics

Total Views & Downloads

BROWSE