Guided conditioning with predictive network on score-based diffusion model for speech enhancement

Kim, Dail; Yang, Da-Hee; Kim, Donghyun; Chang, Joon-Hyuk; Yang, Jaemo; Choi, Jeonghwan; Lee, Moa; Moon, Han-gil

doi:10.21437/Interspeech.2024-1545

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Guided conditioning with predictive network on score-based diffusion model for speech enhancement

Authors: Kim, Dail; Yang, Da-Hee; Kim, Donghyun; Chang, Joon-Hyuk; Yang, Jaemo; Choi, Jeonghwan; Lee, Moa; Moon, Han-gil

Issue Date: Sep-2024

Keywords: Speech enhancement; score-based diffusion models; generative modeling; predictive modeling; conditioning

Citation: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp 1190 - 1194

Pages: 5

Indexed: SCOPUS

Journal Title: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Start Page: 1190

End Page: 1194

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207030

DOI: 10.21437/Interspeech.2024-1545

ISSN: 1990-9772
2308-457X

Abstract: Although diffusion-based speech enhancement (SE) models have emerged, they exhibit lower ability in noise removal than other predictive-based SE models. This reflects a trade-off between generative models, which are capable of producing more natural speech based on estimated target distribution, and predictive models, which are more effective in noise removal. To mitigate this trade-off, we propose a novel conditioning method for score-based diffusion models. The proposed approach involves guiding the diffusion model with a pretrained predictive model without joint training, thereby enabling enhanced speech to offer the proper direction to the diffusion model. The effectiveness of the proposed method is highlighted by outperforming the baseline method, with only half the number of sampling steps.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE