Language Model Personalization for Speech Recognition: A Clustered Federated Learning Approach with Adaptive Weight Average
- Authors
- Lee, Chae-Won; Lee, Jae-Hong; Chang, Joon-Hyuk
- Issue Date
- Jul-2024
- Publisher
- Institute of Electrical and Electronics Engineers
- Keywords
- Automatic speech recognition; personalization; language model; federated learning; non-i.i.d; weight average; clustered federated learning
- Citation
- IEEE Signal Processing Letters, v.31, pp 2710 - 2714
- Pages
- 5
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Signal Processing Letters
- Volume
- 31
- Start Page
- 2710
- End Page
- 2714
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210092
- DOI
- 10.1109/LSP.2024.3434467
- ISSN
- 1070-9908
1558-2361
- Abstract
- In the rapidly evolving field of automatic speech recognition (ASR), the push towards personalization has become a paramount concern. Text-only personalization, while advantageous for data collection and adaptable to text variations, can suffer from overfitting when using personal data and requires extensive data to mitigate this issue. Federated learning (FL) emerges as a solution, facilitating learning from diverse client models while preserving privacy. However, FL addresses the challenges posed by non independent and identically distributed (non-i.i.d) data, potentially leading to poor performance. We propose two approaches for language model personalization in ASR to address these issues. First, adaptive weighted average addresses the limitations of uniform weight average in the existing FL method by combining local language models into a global model. Second, clustered federated learning, based solely on model parameters, improves model stability without relying on information from the local domain. Both strategies aim to enhance personalization and reduce performance degradation, particularly in non-i.i.d scenarios within the FL.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.