Cited 24 time in
On using acoustic environment classification for statistical model-based speech enhancement
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Choi, Jae-Hun | - |
| dc.contributor.author | Chang, Joon-Hyuk | - |
| dc.date.accessioned | 2021-08-02T19:30:04Z | - |
| dc.date.available | 2021-08-02T19:30:04Z | - |
| dc.date.issued | 2012-03 | - |
| dc.identifier.issn | 0167-6393 | - |
| dc.identifier.issn | 1872-7182 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/27582 | - |
| dc.description.abstract | In this paper, we present a statistical model-based speech enhancement technique using acoustic environment classification supported by a Gaussian mixture model (GMM). In the data training stage, the principal parameters of the statistical model-based speech enhancement algorithm such as the weighting parameter in the decision-directed (DD) method, the long-term smoothing parameter of the noise estimation, and the control parameter of the minimum gain value are uniquely set as optimal operating points according to the given noise information to ensure the best performance for each noise. These optimal operating points, which are specific to the different background noises, are estimated based on the composite measures, which are the objective quality measures representing the highest correlation with the actual speech quality processed by noise suppression algorithms. In the on-line environment-aware speech enhancement step, the noise classification is performed on a frame-by-frame basis using the maximum likelihood (ML)-based Gaussian mixture model (GMM). The speech absence probability (SAP) is used to detect the speech absence periods and to update the likelihood of the GMM. According to the classified noise information for each frame, we assign the optimal values to the aforementioned three parameters for speech enhancement. We evaluated the performances of the proposed methods using objective speech quality measures and subjective listening tests under various noise environments. Our experimental results showed that the proposed method yields better performances than does a conventional algorithm with fixed parameters. (C) 2011 Elsevier B.V. All rights reserved. | - |
| dc.format.extent | 14 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Elsevier BV | - |
| dc.title | On using acoustic environment classification for statistical model-based speech enhancement | - |
| dc.type | Article | - |
| dc.publisher.location | 네델란드 | - |
| dc.identifier.doi | 10.1016/j.specom.2011.10.009 | - |
| dc.identifier.scopusid | 2-s2.0-84155164746 | - |
| dc.identifier.wosid | 000300809100012 | - |
| dc.identifier.bibliographicCitation | Speech Communication, v.54, no.3, pp 477 - 490 | - |
| dc.citation.title | Speech Communication | - |
| dc.citation.volume | 54 | - |
| dc.citation.number | 3 | - |
| dc.citation.startPage | 477 | - |
| dc.citation.endPage | 490 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | sci | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Acoustics | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Acoustics | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
| dc.subject.keywordPlus | NOISE | - |
| dc.subject.keywordPlus | SUPPRESSION | - |
| dc.subject.keywordAuthor | Speech enhancement | - |
| dc.subject.keywordAuthor | Noise classification | - |
| dc.subject.keywordAuthor | Gaussian mixture model | - |
| dc.subject.keywordAuthor | DFT | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
