Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

VACE-WPE: Virtual Acoustic Channel Expansion Based on Neural Networks for Weighted Prediction Error-Based Speech Dereverberation

Authors
Yang, Joon-YoungChang, Joon-Hyuk
Issue Date
2022
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Speech processingReverberationMicrophonesNeural networksFeature extractionSpeech recognitionPrediction algorithmsDeep neural networkoffline processingsingle microphonespeech dereverberationweighted prediction error
Citation
IEEE/ACM Transactions on Audio Speech and Language Processing, v.30, pp.174 - 189
Indexed
SCIE
SCOPUS
Journal Title
IEEE/ACM Transactions on Audio Speech and Language Processing
Volume
30
Start Page
174
End Page
189
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/139965
DOI
10.1109/TASLP.2021.3133190
ISSN
2329-9290
Abstract
Speech dereverberation is an important issue for many real-world speech processing applications. Among the techniques developed, the weighted prediction error (WPE) algorithm has been widely adopted and advanced over the last decade, which blindly cancels out the late reverberation component from the reverberant mixture of microphone signals. In this study, we extend the neural-network-based virtual acoustic channel expansion (VACE) framework for the WPE-based speech dereverberation, a variant of the WPE that we recently proposed to enable the use of dual-channel WPE algorithm in a single-microphone speech dereverberation scenario. Based on the previous study, some ablation studies are conducted regarding the constituents of the VACE-WPE in an offline processing scenario. These studies reveal the characteristics of the system, thereby simplifying the architecture and leading to the introduction of new strategies for training the neural network for the VACE. Experimental results demonstrate that VACE-WPE considerably outperforms its single-channel counterpart in simulated noisy reverberant environments in terms of objective speech quality and is superior to the single-channel WPE as well as several fully neural speech dereverberation methods when employed as the front-end for the far-field automatic speech recognizer.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE