Cited 16 time in
Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Moa | - |
| dc.contributor.author | Lee, Jeehye | - |
| dc.contributor.author | Chang, Joon Hyuk | - |
| dc.date.accessioned | 2021-07-30T04:56:16Z | - |
| dc.date.available | 2021-07-30T04:56:16Z | - |
| dc.date.issued | 2019-02 | - |
| dc.identifier.issn | 1051-2004 | - |
| dc.identifier.issn | 1095-4333 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/2260 | - |
| dc.description.abstract | Distant speech recognition is a challenge, particularly due to the corruption of speech signals by reverberation caused by large distances between the speaker and microphone. In order to cope with a wide range of reverberations in real-world situations, we present novel approaches for acoustic modeling including an ensemble of deep neural networks (DNNs) and an ensemble of jointly trained DNNs. First, multiple DNNs are firstly designed, each of which copes with a different reverberation time (RT60) in a setup step. Also, each model in the ensemble of DNN acoustic models is further jointly trained, including both feature mapping and acoustic modeling, where feature mapping is designed for dereverberation as a front-end. In a testing phase, ensemble of DNNs are combined by weighted averaging of the prediction probabilities of the RT60 estimates, which is obtained by the convolutional neural network (CNN). In other words, the posterior probability outputs from DNNs are combined using the CNN-based weights as a weighted average. Extensive experiments demonstrate that the proposed approach leads to substantial improvements in speech recognition accuracy over the conventional DNN baseline systems under diverse reverberant conditions. In this paper, experiments are performed on Aurora-4 and CHiME-4 databases. | - |
| dc.format.extent | 9 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | ACADEMIC PRESS INC ELSEVIER SCIENCE | - |
| dc.title | Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1016/j.dsp.2018.11.005 | - |
| dc.identifier.scopusid | 2-s2.0-85057123464 | - |
| dc.identifier.wosid | 000456890300001 | - |
| dc.identifier.bibliographicCitation | DIGITAL SIGNAL PROCESSING, v.85, pp 1 - 9 | - |
| dc.citation.title | DIGITAL SIGNAL PROCESSING | - |
| dc.citation.volume | 85 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 9 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | sci | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.subject.keywordPlus | FRONT-END | - |
| dc.subject.keywordPlus | ALGORITHM | - |
| dc.subject.keywordPlus | DEREVERBERATION | - |
| dc.subject.keywordPlus | OPTIMIZATION | - |
| dc.subject.keywordPlus | ENVIRONMENT | - |
| dc.subject.keywordPlus | MICROPHONE | - |
| dc.subject.keywordPlus | ADAPTATION | - |
| dc.subject.keywordPlus | REGRESSION | - |
| dc.subject.keywordAuthor | Reverberant speech recognition | - |
| dc.subject.keywordAuthor | Deep neural network | - |
| dc.subject.keywordAuthor | Joint training | - |
| dc.subject.keywordAuthor | Ensemble acoustic model | - |
| dc.subject.keywordAuthor | Convolutional neural network | - |
| dc.identifier.url | https://www.sciencedirect.com/science/article/pii/S1051200418308819?via%3Dihub | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
