Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Language Model Personalization for Speech Recognition: A Clustered Federated Learning Approach with Adaptive Weight Average

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Chae-Won-
dc.contributor.authorLee, Jae-Hong-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2025-12-24T08:00:23Z-
dc.date.available2025-12-24T08:00:23Z-
dc.date.issued2024-07-
dc.identifier.issn1070-9908-
dc.identifier.issn1558-2361-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210092-
dc.description.abstractIn the rapidly evolving field of automatic speech recognition (ASR), the push towards personalization has become a paramount concern. Text-only personalization, while advantageous for data collection and adaptable to text variations, can suffer from overfitting when using personal data and requires extensive data to mitigate this issue. Federated learning (FL) emerges as a solution, facilitating learning from diverse client models while preserving privacy. However, FL addresses the challenges posed by non independent and identically distributed (non-i.i.d) data, potentially leading to poor performance. We propose two approaches for language model personalization in ASR to address these issues. First, adaptive weighted average addresses the limitations of uniform weight average in the existing FL method by combining local language models into a global model. Second, clustered federated learning, based solely on model parameters, improves model stability without relying on information from the local domain. Both strategies aim to enhance personalization and reduce performance degradation, particularly in non-i.i.d scenarios within the FL.-
dc.format.extent5-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers-
dc.titleLanguage Model Personalization for Speech Recognition: A Clustered Federated Learning Approach with Adaptive Weight Average-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/LSP.2024.3434467-
dc.identifier.scopusid2-s2.0-85200234898-
dc.identifier.wosid001335941600005-
dc.identifier.bibliographicCitationIEEE Signal Processing Letters, v.31, pp 2710 - 2714-
dc.citation.titleIEEE Signal Processing Letters-
dc.citation.volume31-
dc.citation.startPage2710-
dc.citation.endPage2714-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusComputational linguistics-
dc.subject.keywordPlusData privacy-
dc.subject.keywordAuthorAutomatic speech recognition-
dc.subject.keywordAuthorpersonalization-
dc.subject.keywordAuthorlanguage model-
dc.subject.keywordAuthorfederated learning-
dc.subject.keywordAuthornon-i.i.d-
dc.subject.keywordAuthorweight average-
dc.subject.keywordAuthorclustered federated learning-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10614356-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE