Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization

Kim, Dong-Hyun; Chang, Joon-Hyuk

doi:10.21437/Interspeech.2024-976

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Dong-Hyun	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2025-02-13T02:30:19Z	-
dc.date.available	2025-02-13T02:30:19Z	-
dc.date.issued	2024-09	-
dc.identifier.issn	1990-9772	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206481	-
dc.description.abstract	Recent advancements in automatic speech recognition such as Wav2vec 2.0 and Whisper, confront deployment challenges due to their substantial model parameters. Model compression through joint distillation and structured pruning emerges as an effective solution but still faces overfitting and catastrophic forgetting, exacerbated by domain shifts or limited data availability. To address this issue, we propose the gradient-guided parameter regularization method aimed at maintaining the model's generality. Our approach employs gradient values to detect overfit-prone parameters in the student model and subsequently regularize these parameters to align closely with their counterparts in the teacher model. Through extensive experiments, we demonstrate the efficacy of our approach in reducing overfitting and enhancing performance, especially in scenarios characterized by domain shifts and limited data availability.	-
dc.format.extent	5	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.title	Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization	-
dc.type	Article	-
dc.identifier.doi	10.21437/Interspeech.2024-976	-
dc.identifier.scopusid	2-s2.0-85214833183	-
dc.identifier.wosid	001331850104120	-
dc.identifier.bibliographicCitation	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp 4493 - 4497	-
dc.citation.title	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH	-
dc.citation.startPage	4493	-
dc.citation.endPage	4497	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.subject.keywordPlus	Parameterization	-
dc.subject.keywordAuthor	automatic speech recognition	-
dc.subject.keywordAuthor	model compression	-
dc.subject.keywordAuthor	structured pruning	-
dc.identifier.url	https://www.isca-archive.org/interspeech_2024/kim24k_interspeech.html	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE