Generative Bias for Robust Visual Question Answering

Cho, Jae Won; Kim, Dong-Jin; Ryu, Hyeonggon; Kweon, In So

doi:10.1109/CVPR52729.2023.01124

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Generative Bias for Robust Visual Question Answering

Full metadata record

DC Field	Value	Language
dc.contributor.author	Cho, Jae Won	-
dc.contributor.author	Kim, Dong-Jin	-
dc.contributor.author	Ryu, Hyeonggon	-
dc.contributor.author	Kweon, In So	-
dc.date.accessioned	2023-09-11T01:51:50Z	-
dc.date.available	2023-09-11T01:51:50Z	-
dc.date.issued	2023-08	-
dc.identifier.issn	1063-6919	-
dc.identifier.issn	2575-7075	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/190374	-
dc.description.abstract	The task of Visual Question Answering (VQA) is knownto be plagued by the issue of VQA models exploiting biases within the dataset to make its final prediction. Variousprevious ensemble based debiasing methods have been proposed where an additional model is purposefully trained tobe biased in order to train a robust target model. However, these methods compute the bias for a model simplyfrom the label statistics of the training data or from singlemodal branches. In this work, in order to better learn thebias a target VQA model suffers from, we propose a generative method to train the bias model directly from the targetmodel, called GenB. In particular, GenB employs a generative network to learn the bias in the target model througha combination of the adversarial objective and knowledgedistillation. We then debias our target model with GenB asa bias model, and show through extensive experiments theeffects of our method on various VQA bias datasets including VQA-CP2, VQA-CP1, GQA-OOD, and VQA-CE, andshow state-of-the-art results with the LXMERT architectureon VQA-CP2.	-
dc.format.extent	10	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	Generative Bias for Robust Visual Question Answering	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/CVPR52729.2023.01124	-
dc.identifier.scopusid	2-s2.0-85210117401	-
dc.identifier.wosid	001062522103095	-
dc.identifier.bibliographicCitation	2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), pp 11681 - 11690	-
dc.citation.title	2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)	-
dc.citation.startPage	11681	-
dc.citation.endPage	11690	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.subject.keywordPlus	Visual languages	-
dc.subject.keywordAuthor	language	-
dc.subject.keywordAuthor	reasoning	-
dc.subject.keywordAuthor	Vision	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10205250	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Dong Jin photo

Kim, Dong Jin: COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE