Focusing on valid search space in Open-World Compositional Zero-Shot Learning by leveraging misleading answers.

Kim, Soohyeong; Lee, Sangjun; Choi, Yong Suk

doi:10.1109/ACCESS.2024.3491174

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Focusing on valid search space in Open-World Compositional Zero-Shot Learning by leveraging misleading answers.

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Soohyeong	-
dc.contributor.author	Lee, Sangjun	-
dc.contributor.author	Choi, Yong Suk	-
dc.date.accessioned	2024-11-29T08:00:15Z	-
dc.date.available	2024-11-29T08:00:15Z	-
dc.date.issued	2024-11	-
dc.identifier.issn	2169-3536	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/198591	-
dc.description.abstract	The goal of Compositional Zero-Shot Learning (CZSL) is to recognize various compositions of state-object pairs. Because the compositions that need to be considered are only a subset of all combinations of states and objects, it is tough for models to predict unseen compositions. Previous work overlooks the problem of predicting non-sensical compositions such as flying dogs. To address this problem, we introduce a novel method for the model to distinguish between target and non-target composition space to avoid predicting absurd compositions. More specifically, in the process of predicting the states and objects, we train the model to increase the similarity with the label that matches the input image while decreasing the similarity with non-matched labels. Our method calculates the logits for the composition labels by combining the similarities of the image-states and the similarities of image-objects respectively. Then, the combined logits and directly computed composition logits are used to minimize the case of the predicting absurd composition. On three well-known datasets such as MIT-States, UT-Zappos, and C-GQA, various experimental results demonstrate our simple and novel approach significantly improves model performances. Code is available at: https://github.com/ToBeSuperior/Annotation-embedding.	-
dc.format.extent	9	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Focusing on valid search space in Open-World Compositional Zero-Shot Learning by leveraging misleading answers.	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ACCESS.2024.3491174	-
dc.identifier.scopusid	2-s2.0-85208665029	-
dc.identifier.wosid	001354635200001	-
dc.identifier.bibliographicCitation	IEEE Access, v.12, pp 165822 - 165830	-
dc.citation.title	IEEE Access	-
dc.citation.volume	12	-
dc.citation.startPage	165822	-
dc.citation.endPage	165830	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.subject.keywordPlus	Adversarial machine learning	-
dc.subject.keywordPlus	C (programming language)	-
dc.subject.keywordPlus	Contrastive Learning	-
dc.subject.keywordPlus	Image annotation	-
dc.subject.keywordPlus	Prediction models	-
dc.subject.keywordAuthor	Compositional Zero-Shot Learning	-
dc.subject.keywordAuthor	Open-World recognition	-
dc.subject.keywordAuthor	Representation Learning	-
dc.subject.keywordAuthor	Vision and Language	-
dc.subject.keywordAuthor	Zero-Shot Learning	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10742371	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE