Dempster-Shafer theory for enhanced statistical model-based voice activity detection
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Park, Tae-Jun | - |
dc.contributor.author | Chang, Joon Hyuk | - |
dc.date.accessioned | 2021-07-30T05:17:08Z | - |
dc.date.available | 2021-07-30T05:17:08Z | - |
dc.date.created | 2021-05-12 | - |
dc.date.issued | 2018-01 | - |
dc.identifier.issn | 0885-2308 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/3932 | - |
dc.description.abstract | In this paper, we propose to combine the posterior probabilities of voice activity derived from different statistical model-based algorithms for enhanced voice activity detection. For this, the Dempster-Shafer (DS) theory of evidence is employed to represent and combine the different probabilities estimated by three different statistical model-based VAD algorithms including the Sohns likelihood ratio test (LRT)-based method, smoothed LRT-based method, and multiple observation LRT-based method. By considering a generalization of the Bayesian framework and permitting the characterization of uncertainty and ignorance through the DS theory, the probability of an ignorant state is eliminated through the orthogonal sum of several speech presence probabilities, which results in the performance improvement when detecting voice activity. According to objective test results, it is discovered the proposed DS theory-based VAD method offers significant improvements over the conventional approaches. (C) 2017 Elsevier Ltd. All rights reserved. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD | - |
dc.title | Dempster-Shafer theory for enhanced statistical model-based voice activity detection | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Chang, Joon Hyuk | - |
dc.identifier.doi | 10.1016/j.csl.2017.07.001 | - |
dc.identifier.scopusid | 2-s2.0-85025133547 | - |
dc.identifier.wosid | 000411903700004 | - |
dc.identifier.bibliographicCitation | COMPUTER SPEECH AND LANGUAGE, v.47, pp.47 - 58 | - |
dc.relation.isPartOf | COMPUTER SPEECH AND LANGUAGE | - |
dc.citation.title | COMPUTER SPEECH AND LANGUAGE | - |
dc.citation.volume | 47 | - |
dc.citation.startPage | 47 | - |
dc.citation.endPage | 58 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.subject.keywordPlus | CONDITIONAL MAP CRITERION | - |
dc.subject.keywordAuthor | Dempster-Shafer theory | - |
dc.subject.keywordAuthor | Voice activity detection | - |
dc.subject.keywordAuthor | Likelihood ratio test | - |
dc.identifier.url | https://www.sciencedirect.com/science/article/pii/S0885230816303680?via%3Dihub | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365
COPYRIGHT © 2021 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.