Cited 0 time in
Diff-PLC: A Diffusion-Based Approach For Effective Packet Loss Concealment
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Yang, Da-Hee | - |
| dc.contributor.author | Chang, Joon-Hyuk | - |
| dc.date.accessioned | 2025-03-11T02:00:14Z | - |
| dc.date.available | 2025-03-11T02:00:14Z | - |
| dc.date.issued | 2025-01 | - |
| dc.identifier.issn | 2639-5479 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206732 | - |
| dc.description.abstract | We introduce diffusion-based packet loss concealment (DiffPLC), a novel approach designed to improve speech quality in the presence of packet losses for speech transmission. Derived from the foundation of a diffusion-based neural vocoder, the Diff-PLC introduces a crucial modification and supplementary concepts for the reconstruction of lost packets. A key aspect of the Diff-PLC involves integrating a feature-wise linear modulation layer into the diffusion model, facilitating the seamless incorporation of a conditioning feature. Furthermore, the Diff-PLC leverages packet loss embedding as an additional conditioning feature which significantly assists the diffusion model in restoring lost packets. The proposed model is evaluated using the blind test set of the INTERSPEECH 2022 PLC challenge, demonstrating the considerable restoration capabilities of Diff-PLC across various reference-free and reference-based metrics, including PLCMOS, PESQ, STOI, and NISQA. | - |
| dc.format.extent | 7 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Diff-PLC: A Diffusion-Based Approach For Effective Packet Loss Concealment | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1109/SLT61566.2024.10832225 | - |
| dc.identifier.scopusid | 2-s2.0-85217377790 | - |
| dc.identifier.wosid | 001440556800048 | - |
| dc.identifier.bibliographicCitation | Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024, pp 357 - 363 | - |
| dc.citation.title | Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024 | - |
| dc.citation.startPage | 357 | - |
| dc.citation.endPage | 363 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Acoustics | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Linguistics | - |
| dc.relation.journalWebOfScienceCategory | Acoustics | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
| dc.relation.journalWebOfScienceCategory | Linguistics | - |
| dc.subject.keywordAuthor | Diffusion probabilistic model | - |
| dc.subject.keywordAuthor | FiLM conditioning | - |
| dc.subject.keywordAuthor | packet loss concealment | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
