Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Learning to Learn Task-Adaptive Hyperparameters for Few-Shot Learning

Full metadata record
DC Field Value Language
dc.contributor.authorBaik, Sungyong-
dc.contributor.authorChoi, Myungsub-
dc.contributor.authorChoi, Janghoon-
dc.contributor.authorKim, Heewon-
dc.contributor.authorLee, Kyoung Mu-
dc.date.accessioned2024-04-09T02:00:23Z-
dc.date.available2024-04-09T02:00:23Z-
dc.date.issued2024-03-
dc.identifier.issn0162-8828-
dc.identifier.issn1939-3539-
dc.identifier.urihttps://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/49454-
dc.description.abstractThe objective of few-shot learning is to design a system that can adapt to a given task with only few examples while achieving generalization. Model-agnostic meta-learning (MAML), which has recently gained the popularity for its simplicity and flexibility, learns a good initialization for fast adaptation to a task under few-data regime. However, its performance has been relatively limited especially when novel tasks are different from tasks previously seen during training. In this work, instead of searching for a better initialization, we focus on designing a better fast adaptation process. Consequently, we propose a new task-adaptive weight update rule that greatly enhances the fast adaptation process. Specifically, we introduce a small meta-network that can generate per-step hyperparameters for each given task: learning rate and weight decay coefficients. The experimental results validate that learning a good weight update rule for fast adaptation is the equally important component that has drawn relatively less attention in the recent few-shot learning approaches. Surprisingly, fast adaptation from random initialization with ALFA can already outperform MAML. Furthermore, the proposed weight-update rule is shown to consistently improve the task-adaptation capability of MAML across diverse problem domains: few-shot classification, cross-domain few-shot classification, regression, visual tracking, and video frame interpolation.-
dc.format.extent14-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE COMPUTER SOC-
dc.titleLearning to Learn Task-Adaptive Hyperparameters for Few-Shot Learning-
dc.typeArticle-
dc.identifier.doi10.1109/TPAMI.2023.3261387-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, v.46, no.3, pp 1441 - 1454-
dc.identifier.wosid001174191100023-
dc.identifier.scopusid2-s2.0-85151570545-
dc.citation.endPage1454-
dc.citation.number3-
dc.citation.startPage1441-
dc.citation.titleIEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE-
dc.citation.volume46-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10080995-
dc.publisher.location미국-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorOptimization-
dc.subject.keywordAuthorMathematical models-
dc.subject.keywordAuthorAdaptation models-
dc.subject.keywordAuthorVisualization-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorNeural networks-
dc.subject.keywordAuthorFew-shot learning-
dc.subject.keywordAuthorMAML-
dc.subject.keywordAuthormeta-learning-
dc.subject.keywordAuthorvideo frame interpolation-
dc.subject.keywordAuthorvisual tracking-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
Go to Link
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Heewon photo

Kim, Heewon
College of Information Technology (Global School of Media)
Read more

Altmetrics

Total Views & Downloads

BROWSE