Speech enhancement using heterogeneous information
- Authors
- Xiong, Yan; Xu, Fang; Chen, Qiang; Zhang, Jun
- Issue Date
- Mar-2020
- Publisher
- IGI Global
- Keywords
- Heterogeneous Information; Model-Based; Multi-Stream; Speech Enhancement; Throat Microphone
- Citation
- Cognitive Analytics: Concepts, Methodologies, Tools, and Applications, pp 1060 - 1074
- Pages
- 15
- Indexed
- SCOPUS
- Journal Title
- Cognitive Analytics: Concepts, Methodologies, Tools, and Applications
- Start Page
- 1060
- End Page
- 1074
- URI
- https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/115719
- DOI
- 10.4018/978-1-7998-2460-2.ch054
- Abstract
- This article describes how to use heterogeneous information in speech enhancement. In most of the current speech enhancement systems, clean speeches are recovered only from the signals collected by acoustic microphones, which will be greatly affected by the acoustic noises. However, heterogeneous information from different kinds of sensors, which is usually called the multi-stream, are seldom used in speech enhancement because the speech waveforms cannot be recovered from the signals provided by many kinds of sensors. In this article, the authors propose a new model-based multi-stream speech enhancement framework that can make use of the heterogeneous information provided by the signals from different kinds of sensors even when some of them are not directly related to the speech waveform. Then a new speech enhancement scheme using the acoustic and throat microphone recordings is also proposed based on the new speech enhancement framework. Experimental results show that the proposed scheme outperforms several single-stream speech enhancement methods in different noisy environments. © 2020, IGI Global.
- Files in This Item
-
Go to Link
- Appears in
Collections - COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.