Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

CLASS: CONTINUAL LEARNING APPROACH FOR SPEECH SUPER-RESOLUTION

Authors
Kim, DonghyunKim, YungyeoChang, Joon-Hyuk
Issue Date
Apr-2024
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Bandwidth extension; continual learning; self-supervised learning; speech super-resolution
Citation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1401 - 1405
Pages
5
Indexed
SCOPUS
Journal Title
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Start Page
1401
End Page
1405
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/197477
DOI
10.1109/ICASSP48485.2024.10445917
ISSN
0736-7791
1520-6149
Abstract
Supervised deep learning has significantly improved bandwidth extension (BWE), whereas the emergence of self-supervised learning (SSL) has prompted the combined exploration of SSL and BWE. Although SSL-based deep learning models have shown to produce better representations than their supervised counterparts when trained naively, their effectiveness diminishes in when the model learns different tasks sequentially. To address this problem, we propose a continual learning framework called CLASS, which incorporates continual learning (CL) and self-supervised pretraining (SSP) to improve BWE performance. The framework integrates SSP and BWE fine-tuning tasks with CL approaches, enabling the model to retain its representation knowledge while adapting to BWE as a target task. We employ the CL fine-tuning loss or exponential moving average algorithm to gradually update model parameters and learn to resemble wideband from narrowband signals without losing information from a previous task. In addition, we present the new continual loss with extended version of elastic weight consolidation by updating fisher information matrix for better BWE performance. Our experimental results demonstrate that the proposed method outperforms the baseline approach on the TIMIT dataset. Furthermore, we explore the impact of different hyperparameter settings, contributing to a more comprehensive understanding of the performance of the proposed framework.
Files in This Item
There are no files associated with this item.
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE