Detailed Information

Cited 23 time in webofscience Cited 26 time in scopus
Metadata Downloads

Ensemble of deep neural networks using acoustic environment classification for statistical model-based voice activity detection

Authors
Hwang, InyoungPark, Hyung-MinChang, Joon-Hyuk
Issue Date
Jul-2016
Publisher
ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
Keywords
Voice activity detection; Statistical model; Acoustic environment classification; Deep neural network; Ensemble
Citation
COMPUTER SPEECH AND LANGUAGE, v.38, pp.1 - 12
Indexed
SCIE
SCOPUS
Journal Title
COMPUTER SPEECH AND LANGUAGE
Volume
38
Start Page
1
End Page
12
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/22317
DOI
10.1016/j.csl.2015.11.003
ISSN
0885-2308
Abstract
In this paper, we investigate the ensemble of deep neural networks (DNNs) by using an acoustic environment classification (AEC) technique for the statistical model-based voice activity detection (VAD). From an investigation of the statistical model-based VAD, it is known that the traditional decision rule is based on the geometric mean of the likelihood ratio or the support vector machine (SVM), which is a shallow model with zero or one hidden layer. Since the shallow models cannot take an advantage of the diversity of the space distribution of features, in the training step, we basically build the multiple DNNs according the different noise types by employing the parameters of the statistical model-based VAD algorithm. In addition, the separate DNN is designed for the AEC algorithm in order to choose the best DNN for each noise. In the on-line noise-aware VAD step, the AEC is first performed on a frame-by-frame basis using the separate DNN so the a posteriori probabilities to identify noise are obtained. Once the probabilities are achieved for each noise, the environmental knowledge is contributed to allow us to combine the speech presence probabilities which are derived from the ensemble of the DNNs trained for the individual noise. Our approach for VAD was evaluated in terms of objective measures and showed significant improvement compared to the conventional algorithm.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE