Cited 0 time in
Subject-Independent Silent Speech Classification Using Three-Axis Accelerometers with z-Axis Vector Rotation-Based Data Augmentation
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jung, Sungmin | - |
| dc.contributor.author | Sohn, Jang Jay | - |
| dc.contributor.author | Kwon, Jinuk | - |
| dc.contributor.author | Im, Chang-Hwan | - |
| dc.date.accessioned | 2026-06-22T06:30:27Z | - |
| dc.date.available | 2026-06-22T06:30:27Z | - |
| dc.date.issued | 2026-03 | - |
| dc.identifier.issn | 2998-4173 | - |
| dc.identifier.issn | 2998-4173 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/214013 | - |
| dc.description.abstract | Silent speech interfaces (SSIs) offer a promising alternative communication method for individuals with speech impairments and in environments where acoustic speech is not feasible. In this study, we propose a subject-independent silent speech recognition system that utilizes facial muscle movements measured by three-axis accelerometers attached to the facial skin. To address inter-individual variability arising from differences in facial anatomy and sensor placement, we introduce spatial normalization and data augmentation methods. First, a z-alignment process aligns the accelerometer z-axis with the direction of gravity, providing a consistent vertical reference across participants. Subsequently, a yaw augmentation process simulates rotational variability of accelerometers in the perpendicular horizontal plane by applying controlled angular perturbations around the z-axis. These techniques eliminate the need for subject-specific calibration while improving model generalizability. The proposed approach was applied to an accelerometer dataset recorded while 20 participants silently spoke 30 Korean words. The results demonstrated substantial performance improvement, with the proposed method achieving an average classification accuracy of 82.93 ± 4.09%, compared with 75.97 ± 6.06% without the proposed approach. Further evaluation on a selected 20-word subset yielded an accuracy of 92.10 ± 3.28%, demonstrating that high-performance subject-independent SSIs can be implemented using the proposed method. | - |
| dc.format.extent | 12 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
| dc.title | Subject-Independent Silent Speech Classification Using Three-Axis Accelerometers with z-Axis Vector Rotation-Based Data Augmentation | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/TASLPRO.2026.3671660 | - |
| dc.identifier.scopusid | 2-s2.0-105032799764 | - |
| dc.identifier.wosid | 001727140100002 | - |
| dc.identifier.bibliographicCitation | IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.34, pp 1686 - 1697 | - |
| dc.citation.title | IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | - |
| dc.citation.volume | 34 | - |
| dc.citation.startPage | 1686 | - |
| dc.citation.endPage | 1697 | - |
| dc.type.docType | Article in press | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Acoustics | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalWebOfScienceCategory | Acoustics | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.subject.keywordPlus | Audio signal processing | - |
| dc.subject.keywordPlus | Classification (of information) | - |
| dc.subject.keywordPlus | Deep learning | - |
| dc.subject.keywordPlus | Face recognition | - |
| dc.subject.keywordPlus | Human computer interaction | - |
| dc.subject.keywordPlus | Human rehabilitation engineering | - |
| dc.subject.keywordPlus | Speech communication | - |
| dc.subject.keywordPlus | Speech recognition | - |
| dc.subject.keywordAuthor | Accelerometers | - |
| dc.subject.keywordAuthor | Accuracy | - |
| dc.subject.keywordAuthor | Deep learning | - |
| dc.subject.keywordAuthor | Speech recognition | - |
| dc.subject.keywordAuthor | Calibration | - |
| dc.subject.keywordAuthor | Artificial intelligence | - |
| dc.subject.keywordAuthor | Vectors | - |
| dc.subject.keywordAuthor | Training | - |
| dc.subject.keywordAuthor | Gravity | - |
| dc.subject.keywordAuthor | Data augmentation | - |
| dc.subject.keywordAuthor | Silent speech recognition | - |
| dc.subject.keywordAuthor | inertial measurement unit (IMU) | - |
| dc.subject.keywordAuthor | deep learning | - |
| dc.subject.keywordAuthor | vector rotation | - |
| dc.subject.keywordAuthor | human-computer interface (HCI) | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/11424301 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
