Statistical Model-Based Voice Activity Detection Based on Second-Order Conditional MAP with Soft Decision
- Authors
- Chang, Joon-Hyuk
- Issue Date
- Apr-2012
- Publisher
- 한국전자통신연구원
- Keywords
- Voice activity detection; second-order conditional MAP; soft decision; likelihood ratio test
- Citation
- ETRI Journal, v.34, no.2, pp 184 - 189
- Pages
- 6
- Indexed
- SCI
SCIE
SCOPUS
KCI
- Journal Title
- ETRI Journal
- Volume
- 34
- Number
- 2
- Start Page
- 184
- End Page
- 189
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/27560
- DOI
- 10.4218/etrij.12.0111.0344
- ISSN
- 1225-6463
2233-7326
- Abstract
- In this paper, we propose a novel approach to statistical model-based voice activity detection (VAD) that incorporates a second-order conditional maximum a posteriori (CMAP) criterion. As a technical improvement for the first-order CMAP criterion in [1], we consider both the current observation and the voice activity decision in the previous two frames to take full consideration of the interframe correlation of voice activity. This is clearly different from the previous approach PI in that we employ the voice activity decisions in the second-order (previous two frames) CMAP, which has quadruple thresholds with an additional degree of freedom, rather than the first-order (previous single frame). Also, a soft-decision scheme is incorporated, resulting in time-varying thresholds for further performance improvement. Experimental results show that the proposed algorithm outperforms the conventional CMAP-based VAD technique under various experimental conditions.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.