Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition

Lee, Moa; Lee, Jeehye; Chang, Joon Hyuk

doi:10.1016/j.dsp.2018.11.005

Detailed Information

Cited 10 time in webofscience

Cited 16 time in scopus

Metadata Downloads

Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Moa	-
dc.contributor.author	Lee, Jeehye	-
dc.contributor.author	Chang, Joon Hyuk	-
dc.date.accessioned	2021-07-30T04:56:16Z	-
dc.date.available	2021-07-30T04:56:16Z	-
dc.date.issued	2019-02	-
dc.identifier.issn	1051-2004	-
dc.identifier.issn	1095-4333	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/2260	-
dc.description.abstract	Distant speech recognition is a challenge, particularly due to the corruption of speech signals by reverberation caused by large distances between the speaker and microphone. In order to cope with a wide range of reverberations in real-world situations, we present novel approaches for acoustic modeling including an ensemble of deep neural networks (DNNs) and an ensemble of jointly trained DNNs. First, multiple DNNs are firstly designed, each of which copes with a different reverberation time (RT60) in a setup step. Also, each model in the ensemble of DNN acoustic models is further jointly trained, including both feature mapping and acoustic modeling, where feature mapping is designed for dereverberation as a front-end. In a testing phase, ensemble of DNNs are combined by weighted averaging of the prediction probabilities of the RT60 estimates, which is obtained by the convolutional neural network (CNN). In other words, the posterior probability outputs from DNNs are combined using the CNN-based weights as a weighted average. Extensive experiments demonstrate that the proposed approach leads to substantial improvements in speech recognition accuracy over the conventional DNN baseline systems under diverse reverberant conditions. In this paper, experiments are performed on Aurora-4 and CHiME-4 databases.	-
dc.format.extent	9	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	ACADEMIC PRESS INC ELSEVIER SCIENCE	-
dc.title	Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1016/j.dsp.2018.11.005	-
dc.identifier.scopusid	2-s2.0-85057123464	-
dc.identifier.wosid	000456890300001	-
dc.identifier.bibliographicCitation	DIGITAL SIGNAL PROCESSING, v.85, pp 1 - 9	-
dc.citation.title	DIGITAL SIGNAL PROCESSING	-
dc.citation.volume	85	-
dc.citation.startPage	1	-
dc.citation.endPage	9	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	sci	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.subject.keywordPlus	FRONT-END	-
dc.subject.keywordPlus	ALGORITHM	-
dc.subject.keywordPlus	DEREVERBERATION	-
dc.subject.keywordPlus	OPTIMIZATION	-
dc.subject.keywordPlus	ENVIRONMENT	-
dc.subject.keywordPlus	MICROPHONE	-
dc.subject.keywordPlus	ADAPTATION	-
dc.subject.keywordPlus	REGRESSION	-
dc.subject.keywordAuthor	Reverberant speech recognition	-
dc.subject.keywordAuthor	Deep neural network	-
dc.subject.keywordAuthor	Joint training	-
dc.subject.keywordAuthor	Ensemble acoustic model	-
dc.subject.keywordAuthor	Convolutional neural network	-
dc.identifier.url	https://www.sciencedirect.com/science/article/pii/S1051200418308819?via%3Dihub	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE