Optimally weighted maximum a posteriori probabilities based on minimum classification error for dual-microphone voice activity detection

Huang, Seng Hyun; Chang, Joon-Hyuk

doi:10.1016/j.apacoust.2016.06.025

Detailed Information

Cited 1 time in webofscience

Cited 1 time in scopus

Metadata Downloads

Optimally weighted maximum a posteriori probabilities based on minimum classification error for dual-microphone voice activity detection

Full metadata record

DC Field	Value	Language
dc.contributor.author	Huang, Seng Hyun	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2021-08-02T15:53:01Z	-
dc.date.available	2021-08-02T15:53:01Z	-
dc.date.issued	2016-12	-
dc.identifier.issn	0003-682X	-
dc.identifier.issn	1872-910X	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/21326	-
dc.description.abstract	The dual-microphone voice activity detection (VAD) technique is proposed by applying discriminative weight training to achieve optimal weighting of spatial features available within the dual-microphone VAD. Since the motivation behind our method is to use the relevant spatial information available from the two microphones, we employ the phase difference, coherence, and power level difference ratio (PLDR) as a feature vector, and then use this feature vector to derive the maximum a posteriori (MAP) probabilities. Then, we combine each MAP probability based on a discriminative weight training, i.e., the minimum classification error (MCE) method to offer an optimal VAD decision in a spectral domain, which successfully represents the dynamic evolution of speech over time even in the non-stationary noise environments. The proposed dual-microphone VAD algorithm outperforms conventional dual microphone VAD methods based on only single feature among the PLDR, phase difference, and spectral coherence.	-
dc.format.extent	9	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Pergamon Press Ltd.	-
dc.title	Optimally weighted maximum a posteriori probabilities based on minimum classification error for dual-microphone voice activity detection	-
dc.type	Article	-
dc.publisher.location	영국	-
dc.identifier.doi	10.1016/j.apacoust.2016.06.025	-
dc.identifier.scopusid	2-s2.0-84979026590	-
dc.identifier.wosid	000380600400025	-
dc.identifier.bibliographicCitation	Applied Acoustics, v.113, pp 221 - 229	-
dc.citation.title	Applied Acoustics	-
dc.citation.volume	113	-
dc.citation.startPage	221	-
dc.citation.endPage	229	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Acoustics	-
dc.relation.journalWebOfScienceCategory	Acoustics	-
dc.subject.keywordPlus	ROBUST SPEECH ENHANCEMENT	-
dc.subject.keywordPlus	VECTOR	-
dc.subject.keywordAuthor	Voice activity detection	-
dc.subject.keywordAuthor	Dual-microphone	-
dc.subject.keywordAuthor	Discriminative weight training	-
dc.subject.keywordAuthor	Minimum classification error	-
dc.identifier.url	https://www.sciencedirect.com/science/article/pii/S0003682X16301827?via%3Dihub	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE