Cited 0 time in
Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Dong-Hyun | - |
| dc.contributor.author | Chang, Joon-Hyuk | - |
| dc.date.accessioned | 2025-02-13T02:30:19Z | - |
| dc.date.available | 2025-02-13T02:30:19Z | - |
| dc.date.issued | 2024-09 | - |
| dc.identifier.issn | 1990-9772 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206481 | - |
| dc.description.abstract | Recent advancements in automatic speech recognition such as Wav2vec 2.0 and Whisper, confront deployment challenges due to their substantial model parameters. Model compression through joint distillation and structured pruning emerges as an effective solution but still faces overfitting and catastrophic forgetting, exacerbated by domain shifts or limited data availability. To address this issue, we propose the gradient-guided parameter regularization method aimed at maintaining the model's generality. Our approach employs gradient values to detect overfit-prone parameters in the student model and subsequently regularize these parameters to align closely with their counterparts in the teacher model. Through extensive experiments, we demonstrate the efficacy of our approach in reducing overfitting and enhancing performance, especially in scenarios characterized by domain shifts and limited data availability. | - |
| dc.format.extent | 5 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.title | Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.21437/Interspeech.2024-976 | - |
| dc.identifier.scopusid | 2-s2.0-85214833183 | - |
| dc.identifier.wosid | 001331850104120 | - |
| dc.identifier.bibliographicCitation | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp 4493 - 4497 | - |
| dc.citation.title | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | - |
| dc.citation.startPage | 4493 | - |
| dc.citation.endPage | 4497 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.subject.keywordPlus | Parameterization | - |
| dc.subject.keywordAuthor | automatic speech recognition | - |
| dc.subject.keywordAuthor | model compression | - |
| dc.subject.keywordAuthor | structured pruning | - |
| dc.identifier.url | https://www.isca-archive.org/interspeech_2024/kim24k_interspeech.html | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
