Cited 0 time in
M-CTRL: A Continual Representation Learning Framework with Slowly Improving Past Pre-Trained Model
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | 최진성 | - |
| dc.contributor.author | 이재홍 | - |
| dc.contributor.author | 이채원 | - |
| dc.contributor.author | Chang, Joon-Hyuk | - |
| dc.date.accessioned | 2024-11-28T11:01:38Z | - |
| dc.date.available | 2024-11-28T11:01:38Z | - |
| dc.date.issued | 2023-06 | - |
| dc.identifier.issn | 0736-7791 | - |
| dc.identifier.issn | 1520-6149 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196181 | - |
| dc.description.abstract | Representation models pre-trained on unlabeled data show competitive performance in speech recognition, even when fine-tuned on small amounts of labeled data. The continual representation learning (CTRL) framework combines pre-training and continual learning methods to obtain powerful representation. CTRL relies on two neural networks, online and offline models, where the fixed latter model transfers information to the former model with continual learning loss. In this paper, we present momentum continual representation learning (M-CTRL), a framework that slowly updates the offline model with an exponential moving average of the online model. Our framework aims to capture information from the offline model improved on past and new domains. To evaluate our framework, we continually pre-train wav2vec 2.0 with M-CTRL in the following order: Librispeech, Wall Street Journal, and TED-LIUM V3. Our experiments demonstrate that M-CTRL improves the performance in the new domain and reduces information loss in the past domain compared to CTRL. | - |
| dc.format.extent | 5 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | M-CTRL: A Continual Representation Learning Framework with Slowly Improving Past Pre-Trained Model | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ICASSP49357.2023.10096793 | - |
| dc.identifier.scopusid | 2-s2.0-85180579760 | - |
| dc.identifier.bibliographicCitation | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5 | - |
| dc.citation.title | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 5 | - |
| dc.type.docType | Conference paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordAuthor | continual learning | - |
| dc.subject.keywordAuthor | domain adaptation | - |
| dc.subject.keywordAuthor | representation learning | - |
| dc.subject.keywordAuthor | semi-supervised learning | - |
| dc.subject.keywordAuthor | speech recognition | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10096793 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
