Cited 0 time in
Generative Bias for Robust Visual Question Answering
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Cho, Jae Won | - |
| dc.contributor.author | Kim, Dong-Jin | - |
| dc.contributor.author | Ryu, Hyeonggon | - |
| dc.contributor.author | Kweon, In So | - |
| dc.date.accessioned | 2023-09-11T01:51:50Z | - |
| dc.date.available | 2023-09-11T01:51:50Z | - |
| dc.date.issued | 2023-08 | - |
| dc.identifier.issn | 1063-6919 | - |
| dc.identifier.issn | 2575-7075 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/190374 | - |
| dc.description.abstract | The task of Visual Question Answering (VQA) is knownto be plagued by the issue of VQA models exploiting biases within the dataset to make its final prediction. Variousprevious ensemble based debiasing methods have been proposed where an additional model is purposefully trained tobe biased in order to train a robust target model. However, these methods compute the bias for a model simplyfrom the label statistics of the training data or from singlemodal branches. In this work, in order to better learn thebias a target VQA model suffers from, we propose a generative method to train the bias model directly from the targetmodel, called GenB. In particular, GenB employs a generative network to learn the bias in the target model througha combination of the adversarial objective and knowledgedistillation. We then debias our target model with GenB asa bias model, and show through extensive experiments theeffects of our method on various VQA bias datasets including VQA-CP2, VQA-CP1, GQA-OOD, and VQA-CE, andshow state-of-the-art results with the LXMERT architectureon VQA-CP2. | - |
| dc.format.extent | 10 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | IEEE COMPUTER SOC | - |
| dc.title | Generative Bias for Robust Visual Question Answering | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/CVPR52729.2023.01124 | - |
| dc.identifier.scopusid | 2-s2.0-85210117401 | - |
| dc.identifier.wosid | 001062522103095 | - |
| dc.identifier.bibliographicCitation | 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), pp 11681 - 11690 | - |
| dc.citation.title | 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | - |
| dc.citation.startPage | 11681 | - |
| dc.citation.endPage | 11690 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.subject.keywordPlus | Visual languages | - |
| dc.subject.keywordAuthor | language | - |
| dc.subject.keywordAuthor | reasoning | - |
| dc.subject.keywordAuthor | Vision | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10205250 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
