Detailed Information

Cited 10 time in webofscience Cited 16 time in scopus
Metadata Downloads

Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Moa-
dc.contributor.authorLee, Jeehye-
dc.contributor.authorChang, Joon Hyuk-
dc.date.accessioned2021-07-30T04:56:16Z-
dc.date.available2021-07-30T04:56:16Z-
dc.date.issued2019-02-
dc.identifier.issn1051-2004-
dc.identifier.issn1095-4333-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/2260-
dc.description.abstractDistant speech recognition is a challenge, particularly due to the corruption of speech signals by reverberation caused by large distances between the speaker and microphone. In order to cope with a wide range of reverberations in real-world situations, we present novel approaches for acoustic modeling including an ensemble of deep neural networks (DNNs) and an ensemble of jointly trained DNNs. First, multiple DNNs are firstly designed, each of which copes with a different reverberation time (RT60) in a setup step. Also, each model in the ensemble of DNN acoustic models is further jointly trained, including both feature mapping and acoustic modeling, where feature mapping is designed for dereverberation as a front-end. In a testing phase, ensemble of DNNs are combined by weighted averaging of the prediction probabilities of the RT60 estimates, which is obtained by the convolutional neural network (CNN). In other words, the posterior probability outputs from DNNs are combined using the CNN-based weights as a weighted average. Extensive experiments demonstrate that the proposed approach leads to substantial improvements in speech recognition accuracy over the conventional DNN baseline systems under diverse reverberant conditions. In this paper, experiments are performed on Aurora-4 and CHiME-4 databases.-
dc.format.extent9-
dc.language영어-
dc.language.isoENG-
dc.publisherACADEMIC PRESS INC ELSEVIER SCIENCE-
dc.titleEnsemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1016/j.dsp.2018.11.005-
dc.identifier.scopusid2-s2.0-85057123464-
dc.identifier.wosid000456890300001-
dc.identifier.bibliographicCitationDIGITAL SIGNAL PROCESSING, v.85, pp 1 - 9-
dc.citation.titleDIGITAL SIGNAL PROCESSING-
dc.citation.volume85-
dc.citation.startPage1-
dc.citation.endPage9-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClasssci-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusFRONT-END-
dc.subject.keywordPlusALGORITHM-
dc.subject.keywordPlusDEREVERBERATION-
dc.subject.keywordPlusOPTIMIZATION-
dc.subject.keywordPlusENVIRONMENT-
dc.subject.keywordPlusMICROPHONE-
dc.subject.keywordPlusADAPTATION-
dc.subject.keywordPlusREGRESSION-
dc.subject.keywordAuthorReverberant speech recognition-
dc.subject.keywordAuthorDeep neural network-
dc.subject.keywordAuthorJoint training-
dc.subject.keywordAuthorEnsemble acoustic model-
dc.subject.keywordAuthorConvolutional neural network-
dc.identifier.urlhttps://www.sciencedirect.com/science/article/pii/S1051200418308819?via%3Dihub-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE