Auto-VirtualNet: Cost-adaptive dynamic architecture search for multi-task learning
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kim, E. | - |
dc.contributor.author | Ahn, C. | - |
dc.contributor.author | Oh, S. | - |
dc.date.accessioned | 2021-06-18T07:14:14Z | - |
dc.date.available | 2021-06-18T07:14:14Z | - |
dc.date.issued | 2021-06-28 | - |
dc.identifier.issn | 0925-2312 | - |
dc.identifier.issn | 1872-8286 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/44119 | - |
dc.description.abstract | Multi-task learning (MTL) improves learning efficiency by solving multiple tasks simultaneously compared to multiple instances of individual learning. However, despite its benefits, there still remain several major challenges: first, negative interference can reduce the learning efficiency when the number of tasks is high or the tasks are of limited relevance. Second, exploring an optimal model structure manually is quite restricted. Last but not least, offering cost-adaptive solutions has not been addressed in the MTL regime. In spite of its notable merits, the combined problem has not been well discussed. In this work, we propose a novel MTL approach to address the combinatorial problem while minimizing memory consumption. The proposed method discovers multiple network models dynamically from a pool of candidate models, and produces a set of widely distributed solutions with respect to different computational costs for each task. For the diversity of candidate models, we modularize the given backbone architecture that generates basic building blocks and then construct a hierarchical structure based on the building blocks. The proposed method is trained to optimize both task performance and computational costs of selected models. The proposed method dynamically generates optimal networks for each task and offers significant performance improvements over existing MTL approaches in a range of experiments. © 2021 Elsevier B.V. | - |
dc.format.extent | 9 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Elsevier B.V. | - |
dc.title | Auto-VirtualNet: Cost-adaptive dynamic architecture search for multi-task learning | - |
dc.type | Article | - |
dc.identifier.doi | 10.1016/j.neucom.2021.02.050 | - |
dc.identifier.bibliographicCitation | Neurocomputing, v.442, pp 116 - 124 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.wosid | 000646233500010 | - |
dc.identifier.scopusid | 2-s2.0-85102857983 | - |
dc.citation.endPage | 124 | - |
dc.citation.startPage | 116 | - |
dc.citation.title | Neurocomputing | - |
dc.citation.volume | 442 | - |
dc.type.docType | Article | - |
dc.publisher.location | 네델란드 | - |
dc.subject.keywordAuthor | Cost-adaptive learning | - |
dc.subject.keywordAuthor | Dynamic architecture search | - |
dc.subject.keywordAuthor | Multi-task learning | - |
dc.subject.keywordAuthor | Task interference | - |
dc.subject.keywordPlus | Linearization | - |
dc.subject.keywordPlus | Memory architecture | - |
dc.subject.keywordPlus | Multi-task learning | - |
dc.subject.keywordPlus | Network architecture | - |
dc.subject.keywordPlus | Structural optimization | - |
dc.subject.keywordPlus | Adaptive dynamics | - |
dc.subject.keywordPlus | Candidate models | - |
dc.subject.keywordPlus | Computational costs | - |
dc.subject.keywordPlus | Cost-adaptive learning | - |
dc.subject.keywordPlus | Dynamic architecture | - |
dc.subject.keywordPlus | Dynamic architecture search | - |
dc.subject.keywordPlus | Learning approach | - |
dc.subject.keywordPlus | Learning efficiency | - |
dc.subject.keywordPlus | Multitask learning | - |
dc.subject.keywordPlus | Task interferences | - |
dc.subject.keywordPlus | Efficiency | - |
dc.subject.keywordPlus | article | - |
dc.subject.keywordPlus | learning | - |
dc.subject.keywordPlus | memory | - |
dc.subject.keywordPlus | task performance | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Artificial Intelligence | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194
COPYRIGHT 2019 Chung-Ang University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.