Ensemble of deep neural networks using acoustic environment classification for statistical model-based voice activity detection

Hwang, Inyoung; Park, Hyung-Min; Chang, Joon-Hyuk

doi:10.1016/j.csl.2015.11.003

Detailed Information

Cited 23 time in webofscience

Cited 26 time in scopus

Metadata Downloads

Ensemble of deep neural networks using acoustic environment classification for statistical model-based voice activity detection

Full metadata record

DC Field	Value	Language
dc.contributor.author	Hwang, Inyoung	-
dc.contributor.author	Park, Hyung-Min	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2021-08-02T16:29:47Z	-
dc.date.available	2021-08-02T16:29:47Z	-
dc.date.issued	2016-07	-
dc.identifier.issn	0885-2308	-
dc.identifier.issn	1095-8363	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/22317	-
dc.description.abstract	In this paper, we investigate the ensemble of deep neural networks (DNNs) by using an acoustic environment classification (AEC) technique for the statistical model-based voice activity detection (VAD). From an investigation of the statistical model-based VAD, it is known that the traditional decision rule is based on the geometric mean of the likelihood ratio or the support vector machine (SVM), which is a shallow model with zero or one hidden layer. Since the shallow models cannot take an advantage of the diversity of the space distribution of features, in the training step, we basically build the multiple DNNs according the different noise types by employing the parameters of the statistical model-based VAD algorithm. In addition, the separate DNN is designed for the AEC algorithm in order to choose the best DNN for each noise. In the on-line noise-aware VAD step, the AEC is first performed on a frame-by-frame basis using the separate DNN so the a posteriori probabilities to identify noise are obtained. Once the probabilities are achieved for each noise, the environmental knowledge is contributed to allow us to combine the speech presence probabilities which are derived from the ensemble of the DNNs trained for the individual noise. Our approach for VAD was evaluated in terms of objective measures and showed significant improvement compared to the conventional algorithm.	-
dc.format.extent	12	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Academic Press	-
dc.title	Ensemble of deep neural networks using acoustic environment classification for statistical model-based voice activity detection	-
dc.type	Article	-
dc.publisher.location	영국	-
dc.identifier.doi	10.1016/j.csl.2015.11.003	-
dc.identifier.scopusid	2-s2.0-84951099137	-
dc.identifier.wosid	000371900800001	-
dc.identifier.bibliographicCitation	Computer Speech and Language, v.38, pp 1 - 12	-
dc.citation.title	Computer Speech and Language	-
dc.citation.volume	38	-
dc.citation.startPage	1	-
dc.citation.endPage	12	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.subject.keywordPlus	SPEECH ENHANCEMENT	-
dc.subject.keywordPlus	NOISE	-
dc.subject.keywordAuthor	Voice activity detection	-
dc.subject.keywordAuthor	Statistical model	-
dc.subject.keywordAuthor	Acoustic environment classification	-
dc.subject.keywordAuthor	Deep neural network	-
dc.subject.keywordAuthor	Ensemble	-
dc.identifier.url	https://www.sciencedirect.com/science/article/pii/S0885230815001072?via%3Dihub	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE