Progressive Subband Modeling for Artifacts-free Speech Super-resolution

Kim, Donghyun; Chang, Joon-Hyuk

doi:10.1109/ICASSP49660.2025.10889911

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Progressive Subband Modeling for Artifacts-free Speech Super-resolution

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Donghyun	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2025-07-24T07:30:22Z	-
dc.date.available	2025-07-24T07:30:22Z	-
dc.date.issued	2025-03	-
dc.identifier.issn	0736-7791	-
dc.identifier.issn	1520-6149	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208320	-
dc.description.abstract	In this paper, we consider new reconstruction loss together with a subband objective in the form of auxiliary loss function for artifacts-free speech super-resolution. Unlike prior work which mainly consider full band of frequency region for speech super-resolution, the proposed method alleviates distortion generated during deep learning training via subband modeling. To further minimize spectral artifacts, we also apply progressive curriculum learning for superior performance. Our experimental results demonstrate that the proposed method outperforms the evaluated baselines on the both TIMIT and VCTK dataset by increase in both intelligibility and perceptual score. Furthermore, the visual representation of spectrograms comparison verify that our proposed method clearly restoring speech with fewer artifacts. Audio samples and the implementations are available online.	-
dc.format.extent	5	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Progressive Subband Modeling for Artifacts-free Speech Super-resolution	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ICASSP49660.2025.10889911	-
dc.identifier.scopusid	2-s2.0-105009589436	-
dc.identifier.wosid	001611517600685	-
dc.identifier.bibliographicCitation	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5	-
dc.citation.title	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings	-
dc.citation.startPage	1	-
dc.citation.endPage	5	-
dc.type.docType	Conference paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	AcousticsComputer ScienceEngineering	-
dc.relation.journalWebOfScienceCategory	AcousticsComputer Science, Artificial IntelligenceEngineering, Electrical & Electronic	-
dc.subject.keywordPlus	Bandwidth	-
dc.subject.keywordPlus	Computer vision	-
dc.subject.keywordPlus	Gears	-
dc.subject.keywordPlus	Intelligent systems	-
dc.subject.keywordPlus	Speech communication	-
dc.subject.keywordAuthor	bandwidth extension	-
dc.subject.keywordAuthor	curriculum learning	-
dc.subject.keywordAuthor	spectral artifacts	-
dc.subject.keywordAuthor	speech super-resolution	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10889911	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE