Cited 0 time in
Diagnosis-aware multitask fine-tuning of Whisper for dysarthric speech recognition
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Chung, Yoona | - |
| dc.contributor.author | Hong, Jeongmin | - |
| dc.contributor.author | Lee, Jaehyuk | - |
| dc.contributor.author | Kim, Eunchan | - |
| dc.date.accessioned | 2026-04-21T06:30:35Z | - |
| dc.date.available | 2026-04-21T06:30:35Z | - |
| dc.date.issued | 2026-05 | - |
| dc.identifier.issn | 0167-6393 | - |
| dc.identifier.issn | 1872-7182 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212288 | - |
| dc.description.abstract | AbstractIndividuals with dysarthria exhibit irregular speech patterns that vary by disease, significantly reducing the accuracy of conventional speech-recognition systems. Previous studies have typically focused on a single disease group or used aggregated data without accounting for inter-disease variation, thereby limiting disease-specific insights. In this study, fluency metrics were extracted from a Korean dysarthric speech corpus across three disease groups (stroke, cerebral palsy, and peripheral neuropathy) and the diseases were classified based on these features. The performance of the disease-specific speech-recognition models was evaluated using the weighted character error rate (Weighted-CER). Results showed that classification based on fluency metrics achieved 99% accuracy. The disease-specific models improved the CER by up to 18.34 and 1.05 percentage points compared with the Whisper–Small model and a model trained on the entire dataset, respectively. In terms of Weighted-CER, the error rate decreased by up to 15.27 and 1.49 percentage points, respectively. These findings indicate that disease-specific models can meaningfully enhance speech recognition and underscore the importance of developing speech-recognition systems that can adapt to individual speech characteristics in patients with dysarthria | - |
| dc.format.extent | 17 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | ELSEVIER | - |
| dc.title | Diagnosis-aware multitask fine-tuning of Whisper for dysarthric speech recognition | - |
| dc.type | Article | - |
| dc.publisher.location | 네덜란드 | - |
| dc.identifier.doi | 10.1016/j.specom.2026.103393 | - |
| dc.identifier.scopusid | 2-s2.0-105034726630 | - |
| dc.identifier.wosid | 001738286000001 | - |
| dc.identifier.bibliographicCitation | SPEECH COMMUNICATION, v.180, pp 1 - 17 | - |
| dc.citation.title | SPEECH COMMUNICATION | - |
| dc.citation.volume | 180 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 17 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Acoustics | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Acoustics | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
| dc.subject.keywordPlus | AUTOMATIC SPEECH | - |
| dc.subject.keywordPlus | FUNCTION APPROXIMATION | - |
| dc.subject.keywordPlus | PARAMETERS | - |
| dc.subject.keywordPlus | DISORDERS | - |
| dc.subject.keywordPlus | LANGUAGE | - |
| dc.subject.keywordPlus | CHILDREN | - |
| dc.subject.keywordPlus | MODEL | - |
| dc.subject.keywordAuthor | Dysarthria | - |
| dc.subject.keywordAuthor | Fluency metrics | - |
| dc.subject.keywordAuthor | Voice quality | - |
| dc.subject.keywordAuthor | Pathology fine-tuning | - |
| dc.subject.keywordAuthor | ASR | - |
| dc.identifier.url | https://www.sciencedirect.com/science/article/pii/S0167639326000415?via%3Dihub | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
