Diagnosis-aware multitask fine-tuning of Whisper for dysarthric speech recognition
- Authors
- Chung, Yoona; Hong, Jeongmin; Lee, Jaehyuk; Kim, Eunchan
- Issue Date
- May-2026
- Publisher
- ELSEVIER
- Keywords
- Dysarthria; Fluency metrics; Voice quality; Pathology fine-tuning; ASR
- Citation
- SPEECH COMMUNICATION, v.180, pp 1 - 17
- Pages
- 17
- Indexed
- SCIE
SCOPUS
- Journal Title
- SPEECH COMMUNICATION
- Volume
- 180
- Start Page
- 1
- End Page
- 17
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212288
- DOI
- 10.1016/j.specom.2026.103393
- ISSN
- 0167-6393
1872-7182
- Abstract
- AbstractIndividuals with dysarthria exhibit irregular speech patterns that vary by disease, significantly reducing the accuracy of conventional speech-recognition systems. Previous studies have typically focused on a single disease group or used aggregated data without accounting for inter-disease variation, thereby limiting disease-specific insights. In this study, fluency metrics were extracted from a Korean dysarthric speech corpus across three disease groups (stroke, cerebral palsy, and peripheral neuropathy) and the diseases were classified based on these features. The performance of the disease-specific speech-recognition models was evaluated using the weighted character error rate (Weighted-CER). Results showed that classification based on fluency metrics achieved 99% accuracy. The disease-specific models improved the CER by up to 18.34 and 1.05 percentage points compared with the Whisper–Small model and a model trained on the entire dataset, respectively. In terms of Weighted-CER, the error rate decreased by up to 15.27 and 1.49 percentage points, respectively. These findings indicate that disease-specific models can meaningfully enhance speech recognition and underscore the importance of developing speech-recognition systems that can adapt to individual speech characteristics in patients with dysarthria
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 정보시스템학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.