Quad-Net: Melspectrogram Vocoder with Convolutional Layers Restricted by the Quadrature Mirror Filter for Perfect Reconstruction
- Authors
- Song, Nam-Seok; Chang, Joon-Hyuk
- Issue Date
- Mar-2025
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- perfect reconstruction; quadrature mirror filter; singal processing; vocoder
- Citation
- ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5
- Pages
- 5
- Indexed
- SCOPUS
- Journal Title
- ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
- Start Page
- 1
- End Page
- 5
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208313
- DOI
- 10.1109/ICASSP49660.2025.10890659
- ISSN
- 0736-7791
1520-6149
- Abstract
- Recently, neural vocoders have applied signal processing methods to synthesize speech to reduce computational complexity. However, most methods lack the benefits of a data-driven approach and the flexibility of hyper-parameters, such as filter length, because they rely on fixed signal processing filters. In this paper, we introduce Quad-Net, a network that includes restricted convolutional layers shaped by quadrature mirror synthesis filter banks. It is optimized with a perfect reconstruction loss derived from perfect reconstruction filter banks. This enables us to control filter lengths and degrees of data-drivenness. The results show that the filter parameters trained in our model exhibit characteristics similar to those of other signal processing methods with lower parameters. Furthermore, by increasing the filter length of Quad-Net, we can obtain filters that have complex frequency responses.It shows that a new approach enables the design of more complex filters that are adaptive to neural networks, diverging from previous methods.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.