Cited 0 time in
Towards Robust Packet Loss Concealment System With ASR-Guided Representations
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | 양다희 | - |
| dc.contributor.author | Chang, Joon-Hyuk | - |
| dc.date.accessioned | 2024-11-28T13:00:41Z | - |
| dc.date.available | 2024-11-28T13:00:41Z | - |
| dc.date.issued | 2023-12 | - |
| dc.identifier.issn | 0000-0000 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196342 | - |
| dc.description.abstract | Despite the significant advancements and promising performance of deep learning-based packet loss concealment (PLC) systems in transmission systems, their focus on modeling acoustic features for reconstructing lost packets is insufficient to achieve smooth transitions during speech reconstruction. Therefore, to address this limitation, we propose integrating linguistic information derived from a speech recognition system as auxiliary features in the PLC system. By extracting ASR-guided representations and incorporating them using auxiliary loss, we successfully demonstrate a substantial improvement in the perceptual quality and intelligibility of the reconstructed speech. Our evaluation conducted on the wall street journal dataset further validates the effectiveness of our approach through experiments involving different packet loss rates and performance metrics. | - |
| dc.format.extent | 8 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Towards Robust Packet Loss Concealment System With ASR-Guided Representations | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ASRU57964.2023.10389616 | - |
| dc.identifier.scopusid | 2-s2.0-85184668562 | - |
| dc.identifier.bibliographicCitation | 2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023, pp 1 - 8 | - |
| dc.citation.title | 2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 8 | - |
| dc.type.docType | Conference paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordPlus | Packet loss | - |
| dc.subject.keywordPlus | Speech communication | - |
| dc.subject.keywordPlus | Speech intelligibility | - |
| dc.subject.keywordPlus | Speech recognition | - |
| dc.subject.keywordPlus | Speech transmission | - |
| dc.subject.keywordAuthor | auxiliary feature | - |
| dc.subject.keywordAuthor | CTC-based ASR system | - |
| dc.subject.keywordAuthor | fine-tuning | - |
| dc.subject.keywordAuthor | Packet loss concealment | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10389616 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
