Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Diagnosis-aware multitask fine-tuning of Whisper for dysarthric speech recognition

Authors
Chung, YoonaHong, JeongminLee, JaehyukKim, Eunchan
Issue Date
May-2026
Publisher
ELSEVIER
Keywords
Dysarthria; Fluency metrics; Voice quality; Pathology fine-tuning; ASR
Citation
SPEECH COMMUNICATION, v.180, pp 1 - 17
Pages
17
Indexed
SCIE
SCOPUS
Journal Title
SPEECH COMMUNICATION
Volume
180
Start Page
1
End Page
17
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212288
DOI
10.1016/j.specom.2026.103393
ISSN
0167-6393
1872-7182
Abstract
AbstractIndividuals with dysarthria exhibit irregular speech patterns that vary by disease, significantly reducing the accuracy of conventional speech-recognition systems. Previous studies have typically focused on a single disease group or used aggregated data without accounting for inter-disease variation, thereby limiting disease-specific insights. In this study, fluency metrics were extracted from a Korean dysarthric speech corpus across three disease groups (stroke, cerebral palsy, and peripheral neuropathy) and the diseases were classified based on these features. The performance of the disease-specific speech-recognition models was evaluated using the weighted character error rate (Weighted-CER). Results showed that classification based on fluency metrics achieved 99% accuracy. The disease-specific models improved the CER by up to 18.34 and 1.05 percentage points compared with the Whisper–Small model and a model trained on the entire dataset, respectively. In terms of Weighted-CER, the error rate decreased by up to 15.27 and 1.49 percentage points, respectively. These findings indicate that disease-specific models can meaningfully enhance speech recognition and underscore the importance of developing speech-recognition systems that can adapt to individual speech characteristics in patients with dysarthria
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 정보시스템학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Eunchan photo

Kim, Eunchan
COLLEGE OF ENGINEERING (DEPARTMENT OF INFORMATION SYSTEMS)
Read more

Altmetrics

Total Views & Downloads

BROWSE