Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Domain Generalization by Mutual-Information Regularization with Pre-trained Models

Full metadata record
DC Field Value Language
dc.contributor.authorCha, J.-
dc.contributor.authorLee, Kyungjae-
dc.contributor.authorPark, S.-
dc.contributor.authorChun, S.-
dc.date.accessioned2023-03-08T05:10:06Z-
dc.date.available2023-03-08T05:10:06Z-
dc.date.issued2022-10-
dc.identifier.issn0302-9743-
dc.identifier.issn1611-3349-
dc.identifier.urihttps://scholarworks.bwise.kr/cau/handle/2019.sw.cau/61189-
dc.description.abstractDomain generalization (DG) aims to learn a generalized model to an unseen target domain using only limited source domains. Previous attempts to DG fail to learn domain-invariant representations only from the source domains due to the significant domain shifts between training and test domains. Instead, we re-formulate the DG objective using mutual information with the oracle model, a model generalized to any possible domain. We derive a tractable variational lower bound via approximating the oracle model by a pre-trained model, called Mutual Information Regularization with Oracle (MIRO). Our extensive experiments show that MIRO significantly improves the out-of-distribution performance. Furthermore, our scaling experiments show that the larger the scale of the pre-trained model, the greater the performance improvement of MIRO. Code is available at https://github.com/kakaobrain/miro. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.-
dc.format.extent18-
dc.language영어-
dc.language.isoENG-
dc.publisherSpringer Science and Business Media Deutschland GmbH-
dc.titleDomain Generalization by Mutual-Information Regularization with Pre-trained Models-
dc.typeArticle-
dc.identifier.doi10.1007/978-3-031-20050-2_26-
dc.identifier.bibliographicCitationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v.13683 LNCS, pp 440 - 457-
dc.description.isOpenAccessN-
dc.identifier.scopusid2-s2.0-85142741908-
dc.citation.endPage457-
dc.citation.startPage440-
dc.citation.titleLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)-
dc.citation.volume13683 LNCS-
dc.type.docTypeConference Paper-
dc.publisher.location미국-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Software > Department of Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Kyungjae photo

Lee, Kyungjae
소프트웨어대학 (AI학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE