Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Diff-PLC: A Diffusion-Based Approach For Effective Packet Loss Concealment

Full metadata record
DC Field Value Language
dc.contributor.authorYang, Da-Hee-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2025-03-11T02:00:14Z-
dc.date.available2025-03-11T02:00:14Z-
dc.date.issued2025-01-
dc.identifier.issn2639-5479-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206732-
dc.description.abstractWe introduce diffusion-based packet loss concealment (DiffPLC), a novel approach designed to improve speech quality in the presence of packet losses for speech transmission. Derived from the foundation of a diffusion-based neural vocoder, the Diff-PLC introduces a crucial modification and supplementary concepts for the reconstruction of lost packets. A key aspect of the Diff-PLC involves integrating a feature-wise linear modulation layer into the diffusion model, facilitating the seamless incorporation of a conditioning feature. Furthermore, the Diff-PLC leverages packet loss embedding as an additional conditioning feature which significantly assists the diffusion model in restoring lost packets. The proposed model is evaluated using the blind test set of the INTERSPEECH 2022 PLC challenge, demonstrating the considerable restoration capabilities of Diff-PLC across various reference-free and reference-based metrics, including PLCMOS, PESQ, STOI, and NISQA.-
dc.format.extent7-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleDiff-PLC: A Diffusion-Based Approach For Effective Packet Loss Concealment-
dc.typeArticle-
dc.identifier.doi10.1109/SLT61566.2024.10832225-
dc.identifier.scopusid2-s2.0-85217377790-
dc.identifier.wosid001440556800048-
dc.identifier.bibliographicCitationProceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024, pp 357 - 363-
dc.citation.titleProceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024-
dc.citation.startPage357-
dc.citation.endPage363-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaAcoustics-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaLinguistics-
dc.relation.journalWebOfScienceCategoryAcoustics-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Interdisciplinary Applications-
dc.relation.journalWebOfScienceCategoryLinguistics-
dc.subject.keywordAuthorDiffusion probabilistic model-
dc.subject.keywordAuthorFiLM conditioning-
dc.subject.keywordAuthorpacket loss concealment-
Files in This Item
There are no files associated with this item.
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE