Diff-PLC: A Diffusion-Based Approach For Effective Packet Loss Concealment

Yang, Da-Hee; Chang, Joon-Hyuk

doi:10.1109/SLT61566.2024.10832225

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Diff-PLC: A Diffusion-Based Approach For Effective Packet Loss Concealment

Full metadata record

DC Field	Value	Language
dc.contributor.author	Yang, Da-Hee	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2025-03-11T02:00:14Z	-
dc.date.available	2025-03-11T02:00:14Z	-
dc.date.issued	2025-01	-
dc.identifier.issn	2639-5479	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206732	-
dc.description.abstract	We introduce diffusion-based packet loss concealment (DiffPLC), a novel approach designed to improve speech quality in the presence of packet losses for speech transmission. Derived from the foundation of a diffusion-based neural vocoder, the Diff-PLC introduces a crucial modification and supplementary concepts for the reconstruction of lost packets. A key aspect of the Diff-PLC involves integrating a feature-wise linear modulation layer into the diffusion model, facilitating the seamless incorporation of a conditioning feature. Furthermore, the Diff-PLC leverages packet loss embedding as an additional conditioning feature which significantly assists the diffusion model in restoring lost packets. The proposed model is evaluated using the blind test set of the INTERSPEECH 2022 PLC challenge, demonstrating the considerable restoration capabilities of Diff-PLC across various reference-free and reference-based metrics, including PLCMOS, PESQ, STOI, and NISQA.	-
dc.format.extent	7	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Diff-PLC: A Diffusion-Based Approach For Effective Packet Loss Concealment	-
dc.type	Article	-
dc.identifier.doi	10.1109/SLT61566.2024.10832225	-
dc.identifier.scopusid	2-s2.0-85217377790	-
dc.identifier.wosid	001440556800048	-
dc.identifier.bibliographicCitation	Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024, pp 357 - 363	-
dc.citation.title	Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024	-
dc.citation.startPage	357	-
dc.citation.endPage	363	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Acoustics	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Linguistics	-
dc.relation.journalWebOfScienceCategory	Acoustics	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Interdisciplinary Applications	-
dc.relation.journalWebOfScienceCategory	Linguistics	-
dc.subject.keywordAuthor	Diffusion probabilistic model	-
dc.subject.keywordAuthor	FiLM conditioning	-
dc.subject.keywordAuthor	packet loss concealment	-

Files in This Item: There are no files associated with this item.

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE