Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Performance improvement in speech recognition using multimodal features

Full metadata record
DC Field Value Language
dc.contributor.authorKim, M.W.-
dc.contributor.authorSong, W.M.-
dc.contributor.authorKim, Y.J.-
dc.contributor.authorKim, E.J.-
dc.date.available2019-04-10T11:35:07Z-
dc.date.created2018-04-17-
dc.date.issued2007-
dc.identifier.isbn0769528759-
dc.identifier.urihttp://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/33809-
dc.description.abstractIn this paper, we propose a neural network based model of robust speech recognition by integrating audio, visual, and contextual information. Bimodal Neural Network(BMNN) is a multi-layer perceptron of 4 layers, which combines audio and visual features of speech to compensate loss of audio information caused by noise. In order to improve the accuracy of speech recognition in noisy environments, we also propose a post-processing based on contextual information which are sequential patterns of words spoken by a user. Our experimental results show that our model outperforms any single mode models. Particularly, when we use the contextual information, we can obtain over 90% recognition accuracy even in noisy environments, which is a significant improvement compared with the state of art in speech recognition. © 2007 IEEE.-
dc.relation.isPartOfProceedings - Third International Conference on Natural Computation, ICNC 2007-
dc.titlePerformance improvement in speech recognition using multimodal features-
dc.typeConference-
dc.identifier.doi10.1109/ICNC.2007.550-
dc.type.rimsCONF-
dc.identifier.bibliographicCitation3rd International Conference on Natural Computation, ICNC 2007, v.2, pp.686 - 690-
dc.description.journalClass2-
dc.identifier.scopusid2-s2.0-38049036416-
dc.citation.conferenceDate2007-08-24-
dc.citation.conferencePlaceHaikou, Hainan-
dc.citation.endPage690-
dc.citation.startPage686-
dc.citation.title3rd International Conference on Natural Computation, ICNC 2007-
dc.citation.volume2-
dc.contributor.affiliatedAuthorKim, M.W.-
dc.type.docTypeConference Paper-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Information Technology > School of Computer Science and Engineering > 2. Conference Papers

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE