Subject-Independent Silent Speech Classification Using Three-Axis Accelerometers with z-Axis Vector Rotation-Based Data Augmentation

Jung, Sungmin; Sohn, Jang Jay; Kwon, Jinuk; Im, Chang-Hwan

doi:10.1109/TASLPRO.2026.3671660

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Subject-Independent Silent Speech Classification Using Three-Axis Accelerometers with z-Axis Vector Rotation-Based Data Augmentation

Full metadata record

DC Field	Value	Language
dc.contributor.author	Jung, Sungmin	-
dc.contributor.author	Sohn, Jang Jay	-
dc.contributor.author	Kwon, Jinuk	-
dc.contributor.author	Im, Chang-Hwan	-
dc.date.accessioned	2026-06-22T06:30:27Z	-
dc.date.available	2026-06-22T06:30:27Z	-
dc.date.issued	2026-03	-
dc.identifier.issn	2998-4173	-
dc.identifier.issn	2998-4173	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/214013	-
dc.description.abstract	Silent speech interfaces (SSIs) offer a promising alternative communication method for individuals with speech impairments and in environments where acoustic speech is not feasible. In this study, we propose a subject-independent silent speech recognition system that utilizes facial muscle movements measured by three-axis accelerometers attached to the facial skin. To address inter-individual variability arising from differences in facial anatomy and sensor placement, we introduce spatial normalization and data augmentation methods. First, a z-alignment process aligns the accelerometer z-axis with the direction of gravity, providing a consistent vertical reference across participants. Subsequently, a yaw augmentation process simulates rotational variability of accelerometers in the perpendicular horizontal plane by applying controlled angular perturbations around the z-axis. These techniques eliminate the need for subject-specific calibration while improving model generalizability. The proposed approach was applied to an accelerometer dataset recorded while 20 participants silently spoke 30 Korean words. The results demonstrated substantial performance improvement, with the proposed method achieving an average classification accuracy of 82.93 ± 4.09%, compared with 75.97 ± 6.06% without the proposed approach. Further evaluation on a selected 20-word subset yielded an accuracy of 92.10 ± 3.28%, demonstrating that high-performance subject-independent SSIs can be implemented using the proposed method.	-
dc.format.extent	12	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Subject-Independent Silent Speech Classification Using Three-Axis Accelerometers with z-Axis Vector Rotation-Based Data Augmentation	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/TASLPRO.2026.3671660	-
dc.identifier.scopusid	2-s2.0-105032799764	-
dc.identifier.wosid	001727140100002	-
dc.identifier.bibliographicCitation	IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.34, pp 1686 - 1697	-
dc.citation.title	IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING	-
dc.citation.volume	34	-
dc.citation.startPage	1686	-
dc.citation.endPage	1697	-
dc.type.docType	Article in press	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Acoustics	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalWebOfScienceCategory	Acoustics	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.subject.keywordPlus	Audio signal processing	-
dc.subject.keywordPlus	Classification (of information)	-
dc.subject.keywordPlus	Deep learning	-
dc.subject.keywordPlus	Face recognition	-
dc.subject.keywordPlus	Human computer interaction	-
dc.subject.keywordPlus	Human rehabilitation engineering	-
dc.subject.keywordPlus	Speech communication	-
dc.subject.keywordPlus	Speech recognition	-
dc.subject.keywordAuthor	Accelerometers	-
dc.subject.keywordAuthor	Accuracy	-
dc.subject.keywordAuthor	Deep learning	-
dc.subject.keywordAuthor	Speech recognition	-
dc.subject.keywordAuthor	Calibration	-
dc.subject.keywordAuthor	Artificial intelligence	-
dc.subject.keywordAuthor	Vectors	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Gravity	-
dc.subject.keywordAuthor	Data augmentation	-
dc.subject.keywordAuthor	Silent speech recognition	-
dc.subject.keywordAuthor	inertial measurement unit (IMU)	-
dc.subject.keywordAuthor	deep learning	-
dc.subject.keywordAuthor	vector rotation	-
dc.subject.keywordAuthor	human-computer interface (HCI)	-
dc.identifier.url	https://ieeexplore.ieee.org/document/11424301	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Im, Chang Hwan photo

Im, Chang Hwan: COLLEGE OF ENGINEERING (서울 바이오메디컬공학전공)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE