Novel three-axis accelerometer-based silent speech interface using deep neural network

Kwon, Jinuk; Nam, Hyerin; Chae, Younsoo; Lee, Seungjae; Kim, In Young; Im, Chang-Hwan

doi:10.1016/j.engappai.2023.105909

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Novel three-axis accelerometer-based silent speech interface using deep neural network

Authors: Kwon, Jinuk; Nam, Hyerin; Chae, Younsoo; Lee, Seungjae; Kim, In Young; Im, Chang-Hwan

Issue Date: Apr-2023

Publisher: PERGAMON-ELSEVIER SCIENCE LTD

Keywords: Silent speech interface (SSI); Three-axis accelerometer; Deep neural network; 1D CNN-bLSTM; Human -computer interface

Citation: ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, v.120, pp.1 - 10

Indexed: SCIE
SCOPUS

Journal Title: ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Volume: 120

Start Page: 1

End Page: 10

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/184925

DOI: 10.1016/j.engappai.2023.105909

ISSN: 0952-1976

Abstract: Silent speech interfaces (SSIs) have been developed as new non-acoustic communication channels for people with speech impairment. Various modalities have been employed to implement SSIs, including ultrasound imaging, electromagnetic articulography, and surface electromyography. In this study, for the first time, we examined the feasibility of implementing an SSI using accelerometers, which have been widely used to acquire motion-related information in human activity recognition. Five accelerometers were attached to the facial surface of participants to measure speech-induced facial movements. A deep neural network architecture combining a one-dimensional (1D) convolutional neural network and bidirectional long shortterm memory (1D CNN-bLSTM) was implemented to decode speech-related information contained in the accelerometer signals. In total, 20 healthy individuals participated in the SSI experiments, wherein they were asked to articulate 40 words consisting of 30 Korean words and 10 English Numbers without vocalization. Leave-one-session-out cross-validation was employed to evaluate the classification accuracy of the proposed accelerometer-based SSI. Consequently, an average classification accuracy of 95.58 +/- 1.83% was achieved with only four accelerometers, which is significantly higher than that of the conventional sEMG-based SSI (89.68 +/- 5.27%, p < 0.0005, Wilcoxon signed-rank test). In addition, the proposed SSI achieved an average classification accuracy of 94.65 +/- 2.54% in classifying 40 English words spoken silently. The result demonstrates that accelerometers can be a promising modality to implement SSIs. Considering that accelerometers have multiple advantages over conventional modalities, including non-invasiveness, costeffectiveness, low power consumption, and portability, it is expected that accelerometer-based SSIs would provide a novel means of communication to those who cannot generate speech signals.

Files in This Item: Go to Link

Appears in Collections: 서울 의과대학 > 서울 의공학교실 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, In Young photo

Kim, In Young: COLLEGE OF MEDICINE (DEPARTMENT OF BIOMEDICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :6,082,178; Today View :4,192

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE