VACE-WPE: Virtual Acoustic Channel Expansion Based on Neural Networks for Weighted Prediction Error-Based Speech Dereverberation
- Authors
- Yang, Joon-Young; Chang, Joon-Hyuk
- Issue Date
- 2022
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Speech processingReverberationMicrophonesNeural networksFeature extractionSpeech recognitionPrediction algorithmsDeep neural networkoffline processingsingle microphonespeech dereverberationweighted prediction error
- Citation
- IEEE/ACM Transactions on Audio Speech and Language Processing, v.30, pp.174 - 189
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE/ACM Transactions on Audio Speech and Language Processing
- Volume
- 30
- Start Page
- 174
- End Page
- 189
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/139965
- DOI
- 10.1109/TASLP.2021.3133190
- ISSN
- 2329-9290
- Abstract
- Speech dereverberation is an important issue for many real-world speech processing applications. Among the techniques developed, the weighted prediction error (WPE) algorithm has been widely adopted and advanced over the last decade, which blindly cancels out the late reverberation component from the reverberant mixture of microphone signals. In this study, we extend the neural-network-based virtual acoustic channel expansion (VACE) framework for the WPE-based speech dereverberation, a variant of the WPE that we recently proposed to enable the use of dual-channel WPE algorithm in a single-microphone speech dereverberation scenario. Based on the previous study, some ablation studies are conducted regarding the constituents of the VACE-WPE in an offline processing scenario. These studies reveal the characteristics of the system, thereby simplifying the architecture and leading to the introduction of new strategies for training the neural network for the VACE. Experimental results demonstrate that VACE-WPE considerably outperforms its single-channel counterpart in simulated noisy reverberant environments in terms of objective speech quality and is superior to the single-channel WPE as well as several fully neural speech dereverberation methods when employed as the front-end for the far-field automatic speech recognizer.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.