Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

IMPROVING TARGET SOUND EXTRACTION WITH TIMESTAMP KNOWLEDGE DISTILLATION

Authors
Kim, DailBaek, Min-SangKim, YungyeoChang, Joon-Hyuk
Issue Date
Apr-2024
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
privileged knowledge distillation; Target sound extraction; timestamp information
Citation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1396 - 1400
Pages
5
Indexed
SCOPUS
Journal Title
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Start Page
1396
End Page
1400
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/197476
DOI
10.1109/ICASSP48485.2024.10447525
ISSN
0736-7791
1520-6149
Abstract
In this paper, we propose a timestamp knowledge distillation (TKD) method that adopts privileged knowledge distillation to enhance the performance of deep neural network (DNN)-based target sound extraction (TSE). While previous studies have mainly used n-hot vectors to indicate the type of target sound events (SEs), which are termed weak labels (WLs), recent studies demonstrated that timestamp knowledge of SEs is meaningful information to improve the TSE performance. To utilize timestamp knowledge, we use the oracle strong labels (OSLs) that indicate the occurrence of target SEs in the audio clip as privileged information. However, the OSLs are difficult to gain in real-world applications compared to WLs. We thus propose the TKD that transfers the timestamp knowledge from the teacher model trained using both WLs and OSLs to the student model trained using only WLs via a loss function. Experimental results across multiple DNN architectures confirmed that the OSLs enhanced the TSE significantly. Moreover, the TKD notably improved the student model's performance compared to the baseline trained only with WLs.
Files in This Item
There are no files associated with this item.
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE