Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Dong-Hyun-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2025-02-13T02:30:19Z-
dc.date.available2025-02-13T02:30:19Z-
dc.date.issued2024-09-
dc.identifier.issn1990-9772-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206481-
dc.description.abstractRecent advancements in automatic speech recognition such as Wav2vec 2.0 and Whisper, confront deployment challenges due to their substantial model parameters. Model compression through joint distillation and structured pruning emerges as an effective solution but still faces overfitting and catastrophic forgetting, exacerbated by domain shifts or limited data availability. To address this issue, we propose the gradient-guided parameter regularization method aimed at maintaining the model's generality. Our approach employs gradient values to detect overfit-prone parameters in the student model and subsequently regularize these parameters to align closely with their counterparts in the teacher model. Through extensive experiments, we demonstrate the efficacy of our approach in reducing overfitting and enhancing performance, especially in scenarios characterized by domain shifts and limited data availability.-
dc.format.extent5-
dc.language영어-
dc.language.isoENG-
dc.titleMitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization-
dc.typeArticle-
dc.identifier.doi10.21437/Interspeech.2024-976-
dc.identifier.scopusid2-s2.0-85214833183-
dc.identifier.wosid001331850104120-
dc.identifier.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp 4493 - 4497-
dc.citation.titleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.startPage4493-
dc.citation.endPage4497-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.subject.keywordPlusParameterization-
dc.subject.keywordAuthorautomatic speech recognition-
dc.subject.keywordAuthormodel compression-
dc.subject.keywordAuthorstructured pruning-
dc.identifier.urlhttps://www.isca-archive.org/interspeech_2024/kim24k_interspeech.html-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE