Performance improvement in speech recognition using multimodal features
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kim, M.W. | - |
dc.contributor.author | Song, W.M. | - |
dc.contributor.author | Kim, Y.J. | - |
dc.contributor.author | Kim, E.J. | - |
dc.date.available | 2019-04-10T11:35:07Z | - |
dc.date.created | 2018-04-17 | - |
dc.date.issued | 2007 | - |
dc.identifier.isbn | 0769528759 | - |
dc.identifier.uri | http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/33809 | - |
dc.description.abstract | In this paper, we propose a neural network based model of robust speech recognition by integrating audio, visual, and contextual information. Bimodal Neural Network(BMNN) is a multi-layer perceptron of 4 layers, which combines audio and visual features of speech to compensate loss of audio information caused by noise. In order to improve the accuracy of speech recognition in noisy environments, we also propose a post-processing based on contextual information which are sequential patterns of words spoken by a user. Our experimental results show that our model outperforms any single mode models. Particularly, when we use the contextual information, we can obtain over 90% recognition accuracy even in noisy environments, which is a significant improvement compared with the state of art in speech recognition. © 2007 IEEE. | - |
dc.relation.isPartOf | Proceedings - Third International Conference on Natural Computation, ICNC 2007 | - |
dc.title | Performance improvement in speech recognition using multimodal features | - |
dc.type | Conference | - |
dc.identifier.doi | 10.1109/ICNC.2007.550 | - |
dc.type.rims | CONF | - |
dc.identifier.bibliographicCitation | 3rd International Conference on Natural Computation, ICNC 2007, v.2, pp.686 - 690 | - |
dc.description.journalClass | 2 | - |
dc.identifier.scopusid | 2-s2.0-38049036416 | - |
dc.citation.conferenceDate | 2007-08-24 | - |
dc.citation.conferencePlace | Haikou, Hainan | - |
dc.citation.endPage | 690 | - |
dc.citation.startPage | 686 | - |
dc.citation.title | 3rd International Conference on Natural Computation, ICNC 2007 | - |
dc.citation.volume | 2 | - |
dc.contributor.affiliatedAuthor | Kim, M.W. | - |
dc.type.docType | Conference Paper | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Soongsil University Library 369 Sangdo-Ro, Dongjak-Gu, Seoul, Korea (06978)02-820-0733
COPYRIGHT ⓒ SOONGSIL UNIVERSITY, ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.