Cited 0 time in
Progressive Subband Modeling for Artifacts-free Speech Super-resolution
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Donghyun | - |
| dc.contributor.author | Chang, Joon-Hyuk | - |
| dc.date.accessioned | 2025-07-24T07:30:22Z | - |
| dc.date.available | 2025-07-24T07:30:22Z | - |
| dc.date.issued | 2025-03 | - |
| dc.identifier.issn | 0736-7791 | - |
| dc.identifier.issn | 1520-6149 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208320 | - |
| dc.description.abstract | In this paper, we consider new reconstruction loss together with a subband objective in the form of auxiliary loss function for artifacts-free speech super-resolution. Unlike prior work which mainly consider full band of frequency region for speech super-resolution, the proposed method alleviates distortion generated during deep learning training via subband modeling. To further minimize spectral artifacts, we also apply progressive curriculum learning for superior performance. Our experimental results demonstrate that the proposed method outperforms the evaluated baselines on the both TIMIT and VCTK dataset by increase in both intelligibility and perceptual score. Furthermore, the visual representation of spectrograms comparison verify that our proposed method clearly restoring speech with fewer artifacts. Audio samples and the implementations are available online. | - |
| dc.format.extent | 5 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Progressive Subband Modeling for Artifacts-free Speech Super-resolution | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ICASSP49660.2025.10889911 | - |
| dc.identifier.scopusid | 2-s2.0-105009589436 | - |
| dc.identifier.wosid | 001611517600685 | - |
| dc.identifier.bibliographicCitation | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5 | - |
| dc.citation.title | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 5 | - |
| dc.type.docType | Conference paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | AcousticsComputer ScienceEngineering | - |
| dc.relation.journalWebOfScienceCategory | AcousticsComputer Science, Artificial IntelligenceEngineering, Electrical & Electronic | - |
| dc.subject.keywordPlus | Bandwidth | - |
| dc.subject.keywordPlus | Computer vision | - |
| dc.subject.keywordPlus | Gears | - |
| dc.subject.keywordPlus | Intelligent systems | - |
| dc.subject.keywordPlus | Speech communication | - |
| dc.subject.keywordAuthor | bandwidth extension | - |
| dc.subject.keywordAuthor | curriculum learning | - |
| dc.subject.keywordAuthor | spectral artifacts | - |
| dc.subject.keywordAuthor | speech super-resolution | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10889911 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
