Virtual acoustic channel expansion based on neural networks for weighted prediction error-based speech dereverberation
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Yang, Joon-Young | - |
dc.contributor.author | Chang, Joon Hyuk | - |
dc.date.accessioned | 2021-07-30T05:13:24Z | - |
dc.date.available | 2021-07-30T05:13:24Z | - |
dc.date.created | 2021-05-11 | - |
dc.date.issued | 2020-10 | - |
dc.identifier.issn | 2308-457X | - |
dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/3662 | - |
dc.description.abstract | In this study, we propose a neural-network-based virtual acoustic channel expansion (VACE) framework for weighted prediction error (WPE)-based speech dereverberation. Specifically, for the situation in which only a single microphone observation is available, we aim to build a neural network capable of generating a virtual signal that can be exploited as the secondary input for the dual-channel WPE algorithm, thus making its dereverberation performance superior to the single-channel WPE. To implement the VACE-WPE, the neural network for the VACE is initialized and integrated to the pre-trained neural WPE algorithm. The entire system is then trained in a supervised manner to output a dereverberated signal that is close to the oracle early arriving speech. Experimental results show that the proposed VACE-WPE method outperforms the single-channel WPE in a real room impulse response shortening task. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | International Speech Communication Association | - |
dc.title | Virtual acoustic channel expansion based on neural networks for weighted prediction error-based speech dereverberation | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Chang, Joon Hyuk | - |
dc.identifier.doi | 10.21437/Interspeech.2020-1553 | - |
dc.identifier.scopusid | 2-s2.0-85098130955 | - |
dc.identifier.bibliographicCitation | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v.2020, no.October, pp.3930 - 3934 | - |
dc.relation.isPartOf | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | - |
dc.citation.title | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | - |
dc.citation.volume | 2020 | - |
dc.citation.number | October | - |
dc.citation.startPage | 3930 | - |
dc.citation.endPage | 3934 | - |
dc.type.rims | ART | - |
dc.type.docType | Conference Paper | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordPlus | Impulse response | - |
dc.subject.keywordPlus | Speech communication | - |
dc.subject.keywordPlus | Dereverberation | - |
dc.subject.keywordPlus | Entire system | - |
dc.subject.keywordPlus | Room impulse response | - |
dc.subject.keywordPlus | Single channels | - |
dc.subject.keywordPlus | Speech dereverberation | - |
dc.subject.keywordPlus | Virtual acoustics | - |
dc.subject.keywordPlus | Virtual signals | - |
dc.subject.keywordPlus | Weighted predictions | - |
dc.subject.keywordPlus | Neural networks | - |
dc.subject.keywordAuthor | Multi-channel linear prediction | - |
dc.subject.keywordAuthor | Neural network | - |
dc.subject.keywordAuthor | Speech dereverberation | - |
dc.subject.keywordAuthor | Weighted prediction error | - |
dc.identifier.url | https://www.isca-speech.org/archive/interspeech_2020/yang20j_interspeech.html | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365
COPYRIGHT © 2021 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.