Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

An experimental study of diffusion-based general speech restoration with predictive-guided conditioning

Full metadata record
DC Field Value Language
dc.contributor.authorYang, Da-Hee-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2026-03-23T02:00:28Z-
dc.date.available2026-03-23T02:00:28Z-
dc.date.issued2026-07-
dc.identifier.issn0885-2308-
dc.identifier.issn1095-8363-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/211432-
dc.description.abstractThis study presents a hybrid speech restoration framework that integrates predictive-guided conditioning into a diffusion-based generative model to address complex distortions, including noise, reverberation, and bandwidth reduction. The proposed method employs the outputs of a predictive model to guide the diffusion process, enabling more accurate reconstruction under challenging acoustic conditions. Furthermore, during the final sampling stage, the outputs of the predictive and generative models are fused with a tunable ratio, balancing signal fidelity and perceptual naturalness. Experimental results demonstrate that the proposed approach significantly improves objective restoration metrics compared to conventional diffusion baselines. However, the perceptual quality varies with the fusion ratio, revealing a trade-off between objective gains and subjective preference. These findings highlight the potential of predictive-guided conditioning for robust speech restoration and provide insights into optimizing the balance between predictive and generative contributions.-
dc.format.extent11-
dc.language영어-
dc.language.isoENG-
dc.publisherACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD-
dc.titleAn experimental study of diffusion-based general speech restoration with predictive-guided conditioning-
dc.typeArticle-
dc.publisher.location영국-
dc.identifier.doi10.1016/j.csl.2026.101940-
dc.identifier.scopusid2-s2.0-105027726418-
dc.identifier.wosid001674828400001-
dc.identifier.bibliographicCitationCOMPUTER SPEECH AND LANGUAGE, v.99, pp 1 - 11-
dc.citation.titleCOMPUTER SPEECH AND LANGUAGE-
dc.citation.volume99-
dc.citation.startPage1-
dc.citation.endPage11-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.subject.keywordPlusAcoustic noise-
dc.subject.keywordPlusArchitectural acoustics-
dc.subject.keywordPlusDiffusion-
dc.subject.keywordPlusRestoration-
dc.subject.keywordPlusSpeech communication-
dc.subject.keywordPlusSpeech enhancement-
dc.subject.keywordAuthorScore-based diffusion model-
dc.subject.keywordAuthorPredictive-guided conditioning-
dc.subject.keywordAuthorGeneral speech restoration-
dc.subject.keywordAuthorSpeech enhancement-
dc.identifier.urlhttps://www.sciencedirect.com/science/article/pii/S0885230826000045?via%3Dihub-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE