Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Hypergraph Transformer: Weakly-Supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering

Full metadata record
DC Field Value Language
dc.contributor.authorHeo, Yu-Jung-
dc.contributor.authorKim, Eun-Sol-
dc.contributor.authorChoi, Woo Suk-
dc.contributor.authorZhang, Byoung-Tak-
dc.date.accessioned2022-10-25T07:40:35Z-
dc.date.available2022-10-25T07:40:35Z-
dc.date.created2022-10-06-
dc.date.issued2022-09-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/172562-
dc.description.abstractKnowledge-based visual question answering (QA) aims to answer a question which requires visually-grounded external knowledge beyond image content itself. Answering complex questions that require multi-hop reasoning under weak supervision is considered as a challenging problem since i) no supervision is given to the reasoning process and ii) high-order semantics of multi-hop knowledge facts need to be captured. In this paper, we introduce a concept of hypergraph to encode high-level semantics of a question and a knowledge base, and to learn high-order associations between them. The proposed model, Hypergraph Transformer, constructs a question hypergraph and a query-aware knowledge hypergraph, and infers an answer by encoding inter-associations between two hypergraphs and intra-associations in both hypergraph itself. Extensive experiments on two knowledge-based visual QA and two knowledge-based textual QA demonstrate the effectiveness of our method, especially for multi-hop reasoning problem. Our source code is available at https://github.com/yujungheo/kbvqa-public.-
dc.language영어-
dc.language.isoen-
dc.publisherASSOC COMPUTATIONAL LINGUISTICS-ACL-
dc.titleHypergraph Transformer: Weakly-Supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering-
dc.typeArticle-
dc.contributor.affiliatedAuthorKim, Eun-Sol-
dc.identifier.wosid000828702300029-
dc.identifier.bibliographicCitationPROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), pp.373 - 390-
dc.relation.isPartOfPROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS)-
dc.citation.titlePROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS)-
dc.citation.startPage373-
dc.citation.endPage390-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass3-
dc.description.isOpenAccessN-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaLinguistics-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Interdisciplinary Applications-
dc.relation.journalWebOfScienceCategoryLinguistics-
Files in This Item
There are no files associated with this item.
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Eun Sol photo

Kim, Eun Sol
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE