Cited 1 time in
Optimally weighted maximum a posteriori probabilities based on minimum classification error for dual-microphone voice activity detection
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Huang, Seng Hyun | - |
| dc.contributor.author | Chang, Joon-Hyuk | - |
| dc.date.accessioned | 2021-08-02T15:53:01Z | - |
| dc.date.available | 2021-08-02T15:53:01Z | - |
| dc.date.issued | 2016-12 | - |
| dc.identifier.issn | 0003-682X | - |
| dc.identifier.issn | 1872-910X | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/21326 | - |
| dc.description.abstract | The dual-microphone voice activity detection (VAD) technique is proposed by applying discriminative weight training to achieve optimal weighting of spatial features available within the dual-microphone VAD. Since the motivation behind our method is to use the relevant spatial information available from the two microphones, we employ the phase difference, coherence, and power level difference ratio (PLDR) as a feature vector, and then use this feature vector to derive the maximum a posteriori (MAP) probabilities. Then, we combine each MAP probability based on a discriminative weight training, i.e., the minimum classification error (MCE) method to offer an optimal VAD decision in a spectral domain, which successfully represents the dynamic evolution of speech over time even in the non-stationary noise environments. The proposed dual-microphone VAD algorithm outperforms conventional dual microphone VAD methods based on only single feature among the PLDR, phase difference, and spectral coherence. | - |
| dc.format.extent | 9 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Pergamon Press Ltd. | - |
| dc.title | Optimally weighted maximum a posteriori probabilities based on minimum classification error for dual-microphone voice activity detection | - |
| dc.type | Article | - |
| dc.publisher.location | 영국 | - |
| dc.identifier.doi | 10.1016/j.apacoust.2016.06.025 | - |
| dc.identifier.scopusid | 2-s2.0-84979026590 | - |
| dc.identifier.wosid | 000380600400025 | - |
| dc.identifier.bibliographicCitation | Applied Acoustics, v.113, pp 221 - 229 | - |
| dc.citation.title | Applied Acoustics | - |
| dc.citation.volume | 113 | - |
| dc.citation.startPage | 221 | - |
| dc.citation.endPage | 229 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Acoustics | - |
| dc.relation.journalWebOfScienceCategory | Acoustics | - |
| dc.subject.keywordPlus | ROBUST SPEECH ENHANCEMENT | - |
| dc.subject.keywordPlus | VECTOR | - |
| dc.subject.keywordAuthor | Voice activity detection | - |
| dc.subject.keywordAuthor | Dual-microphone | - |
| dc.subject.keywordAuthor | Discriminative weight training | - |
| dc.subject.keywordAuthor | Minimum classification error | - |
| dc.identifier.url | https://www.sciencedirect.com/science/article/pii/S0003682X16301827?via%3Dihub | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
