Gating Mechanism in Deep Neural Networks for Resource-Efficient Continual Learningopen access
- Authors
- Jin, H.; Yun, K.; Kim, Eun Woo
- Issue Date
- 2022
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Continual learning; Feature extraction; gating mechanism; Interference; Logic gates; Neural networks; Resource management; resource-efficient learning; Task analysis; task interference; Training
- Citation
- IEEE Access, v.10, pp 18776 - 18786
- Pages
- 11
- Journal Title
- IEEE Access
- Volume
- 10
- Start Page
- 18776
- End Page
- 18786
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/61910
- DOI
- 10.1109/ACCESS.2022.3147237
- ISSN
- 2169-3536
- Abstract
- Catastrophic forgetting is well-known tendency in continual learning of a deep neural network to forget previously learned knowledge when optimizing for sequentially incoming tasks. To address the issue, several methods have been proposed in research on continual learning. However, theses methods cannot preserve the previously learned knowledge when training for a new task. Moreover, these methods are susceptible to negative interference between tasks, which may lead to catastrophic forgetting. It even becomes increasingly severe when there exists a notable gap between the domains of tasks. This paper proposes a novel method of controlling gates to select a subset of parameters learned for old tasks, which are then used to efficiently optimize a new task while avoiding negative interference. The proposed approach executes the subset of old parameters that provides positive responses by evaluating the effect when the old and new parameters are used together. The execution or skipping of old parameters through the gates is based on several responses across the network. We evaluate the proposed method in different continual learning scenarios involving image classification datasets. The proposed method outperforms other competitive methods and requires fewer parameters than the state-of-the-art methods during inference by applying the proposed gating mechanism that selectively involves a set of old parameters that provides positive prior knowledge to newer tasks. Additionally, we further prove the effectiveness of the proposed method through various analyses. Author
- Files in This Item
-
- Appears in
Collections - College of Software > School of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.