IMPROVING TARGET SOUND EXTRACTION WITH TIMESTAMP KNOWLEDGE DISTILLATION
- Authors
- Kim, Dail; Baek, Min-Sang; Kim, Yungyeo; Chang, Joon-Hyuk
- Issue Date
- Apr-2024
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- privileged knowledge distillation; Target sound extraction; timestamp information
- Citation
- ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1396 - 1400
- Pages
- 5
- Indexed
- SCOPUS
- Journal Title
- ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
- Start Page
- 1396
- End Page
- 1400
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/197476
- DOI
- 10.1109/ICASSP48485.2024.10447525
- ISSN
- 0736-7791
1520-6149
- Abstract
- In this paper, we propose a timestamp knowledge distillation (TKD) method that adopts privileged knowledge distillation to enhance the performance of deep neural network (DNN)-based target sound extraction (TSE). While previous studies have mainly used n-hot vectors to indicate the type of target sound events (SEs), which are termed weak labels (WLs), recent studies demonstrated that timestamp knowledge of SEs is meaningful information to improve the TSE performance. To utilize timestamp knowledge, we use the oracle strong labels (OSLs) that indicate the occurrence of target SEs in the audio clip as privileged information. However, the OSLs are difficult to gain in real-world applications compared to WLs. We thus propose the TKD that transfers the timestamp knowledge from the teacher model trained using both WLs and OSLs to the student model trained using only WLs via a loss function. Experimental results across multiple DNN architectures confirmed that the OSLs enhanced the TSE significantly. Moreover, the TKD notably improved the student model's performance compared to the baseline trained only with WLs.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.