Subject-Independent Silent Speech Classification Using Three-Axis Accelerometers with z-Axis Vector Rotation-Based Data Augmentation

Jung, Sungmin; Sohn, Jang Jay; Kwon, Jinuk; Im, Chang-Hwan

doi:10.1109/TASLPRO.2026.3671660

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Subject-Independent Silent Speech Classification Using Three-Axis Accelerometers with z-Axis Vector Rotation-Based Data Augmentation

Authors: Jung, Sungmin; Sohn, Jang Jay; Kwon, Jinuk; Im, Chang-Hwan

Issue Date: Mar-2026

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Accelerometers; Accuracy; Deep learning; Speech recognition; Calibration; Artificial intelligence; Vectors; Training; Gravity; Data augmentation; Silent speech recognition; inertial measurement unit (IMU); deep learning; vector rotation; human-computer interface (HCI)

Citation: IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.34, pp 1686 - 1697

Pages: 12

Indexed: SCIE
SCOPUS

Journal Title: IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

Volume: 34

Start Page: 1686

End Page: 1697

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/214013

DOI: 10.1109/TASLPRO.2026.3671660

ISSN: 2998-4173
2998-4173

Abstract: Silent speech interfaces (SSIs) offer a promising alternative communication method for individuals with speech impairments and in environments where acoustic speech is not feasible. In this study, we propose a subject-independent silent speech recognition system that utilizes facial muscle movements measured by three-axis accelerometers attached to the facial skin. To address inter-individual variability arising from differences in facial anatomy and sensor placement, we introduce spatial normalization and data augmentation methods. First, a z-alignment process aligns the accelerometer z-axis with the direction of gravity, providing a consistent vertical reference across participants. Subsequently, a yaw augmentation process simulates rotational variability of accelerometers in the perpendicular horizontal plane by applying controlled angular perturbations around the z-axis. These techniques eliminate the need for subject-specific calibration while improving model generalizability. The proposed approach was applied to an accelerometer dataset recorded while 20 participants silently spoke 30 Korean words. The results demonstrated substantial performance improvement, with the proposed method achieving an average classification accuracy of 82.93 ± 4.09%, compared with 75.97 ± 6.06% without the proposed approach. Further evaluation on a selected 20-word subset yielded an accuracy of 92.10 ± 3.28%, demonstrating that high-performance subject-independent SSIs can be implemented using the proposed method.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Im, Chang Hwan photo

Im, Chang Hwan: COLLEGE OF ENGINEERING (서울 바이오메디컬공학전공)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE