Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Focusing on valid search space in Open-World Compositional Zero-Shot Learning by leveraging misleading answers.

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Soohyeong-
dc.contributor.authorLee, Sangjun-
dc.contributor.authorChoi, Yong Suk-
dc.date.accessioned2024-11-29T08:00:15Z-
dc.date.available2024-11-29T08:00:15Z-
dc.date.issued2024-11-
dc.identifier.issn2169-3536-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/198591-
dc.description.abstractThe goal of Compositional Zero-Shot Learning (CZSL) is to recognize various compositions of state-object pairs. Because the compositions that need to be considered are only a subset of all combinations of states and objects, it is tough for models to predict unseen compositions. Previous work overlooks the problem of predicting non-sensical compositions such as flying dogs. To address this problem, we introduce a novel method for the model to distinguish between target and non-target composition space to avoid predicting absurd compositions. More specifically, in the process of predicting the states and objects, we train the model to increase the similarity with the label that matches the input image while decreasing the similarity with non-matched labels. Our method calculates the logits for the composition labels by combining the similarities of the image-states and the similarities of image-objects respectively. Then, the combined logits and directly computed composition logits are used to minimize the case of the predicting absurd composition. On three well-known datasets such as MIT-States, UT-Zappos, and C-GQA, various experimental results demonstrate our simple and novel approach significantly improves model performances. Code is available at: https://github.com/ToBeSuperior/Annotation-embedding.-
dc.format.extent9-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleFocusing on valid search space in Open-World Compositional Zero-Shot Learning by leveraging misleading answers.-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/ACCESS.2024.3491174-
dc.identifier.scopusid2-s2.0-85208665029-
dc.identifier.wosid001354635200001-
dc.identifier.bibliographicCitationIEEE Access, v.12, pp 165822 - 165830-
dc.citation.titleIEEE Access-
dc.citation.volume12-
dc.citation.startPage165822-
dc.citation.endPage165830-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusAdversarial machine learning-
dc.subject.keywordPlusC (programming language)-
dc.subject.keywordPlusContrastive Learning-
dc.subject.keywordPlusImage annotation-
dc.subject.keywordPlusPrediction models-
dc.subject.keywordAuthorCompositional Zero-Shot Learning-
dc.subject.keywordAuthorOpen-World recognition-
dc.subject.keywordAuthorRepresentation Learning-
dc.subject.keywordAuthorVision and Language-
dc.subject.keywordAuthorZero-Shot Learning-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10742371-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE