Augmented Latent Features of Deep Neural Network-Based Automatic Speech Recognition for Motor-Driven Robotsopen access
- Authors
- Lee, Moa; Chang, Joon-Hyuk
- Issue Date
- Jul-2020
- Publisher
- MDPI
- Keywords
- automatic speech recognition; human-robot interaction; deep learning; bottleneck layer; latent feature; bottleneck network
- Citation
- APPLIED SCIENCES-BASEL, v.10, no.13, pp.1 - 10
- Indexed
- SCIE
SCOPUS
- Journal Title
- APPLIED SCIENCES-BASEL
- Volume
- 10
- Number
- 13
- Start Page
- 1
- End Page
- 10
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/9680
- DOI
- 10.3390/app10134602
- ISSN
- 2076-3417
- Abstract
- Speech recognition for intelligent robots seems to suffer from performance degradation due to ego-noise. The ego-noise is caused by the motors, fans, and mechanical parts inside the intelligent robots especially when the robot moves or shakes its body. To overcome the problems caused by the ego-noise, we propose a robust speech recognition algorithm that uses motor-state information of the robot as an auxiliary feature. For this, we use two deep neural networks (DNN) in this paper. Firstly, we design the latent features using a bottleneck layer, one of the internal layers having a smaller number of hidden units relative to the other layers, to represent whether the motor is operating or not. The latent features maximizing the representation of the motor-state information are generated by taking the motor data and acoustic features as the input of the first DNN. Secondly, once the motor-state dependent latent features are designed at the first DNN, the second DNN, accounting for acoustic modeling, receives the latent features as the input along with the acoustic features. We evaluated the proposed system on LibriSpeech database. The proposed network enables efficient compression of the acoustic and motor-state information, and the resulting word error rate (WER) are superior to that of a conventional speech recognition system.
- Files in This Item
-
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/9680)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.