IMPROVING TARGET SOUND EXTRACTION WITH TIMESTAMP KNOWLEDGE DISTILLATION

Kim, Dail; Baek, Min-Sang; Kim, Yungyeo; Chang, Joon-Hyuk

doi:10.1109/ICASSP48485.2024.10447525

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

IMPROVING TARGET SOUND EXTRACTION WITH TIMESTAMP KNOWLEDGE DISTILLATION

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Dail	-
dc.contributor.author	Baek, Min-Sang	-
dc.contributor.author	Kim, Yungyeo	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2024-11-28T16:01:46Z	-
dc.date.available	2024-11-28T16:01:46Z	-
dc.date.issued	2024-04	-
dc.identifier.issn	0736-7791	-
dc.identifier.issn	1520-6149	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/197476	-
dc.description.abstract	In this paper, we propose a timestamp knowledge distillation (TKD) method that adopts privileged knowledge distillation to enhance the performance of deep neural network (DNN)-based target sound extraction (TSE). While previous studies have mainly used n-hot vectors to indicate the type of target sound events (SEs), which are termed weak labels (WLs), recent studies demonstrated that timestamp knowledge of SEs is meaningful information to improve the TSE performance. To utilize timestamp knowledge, we use the oracle strong labels (OSLs) that indicate the occurrence of target SEs in the audio clip as privileged information. However, the OSLs are difficult to gain in real-world applications compared to WLs. We thus propose the TKD that transfers the timestamp knowledge from the teacher model trained using both WLs and OSLs to the student model trained using only WLs via a loss function. Experimental results across multiple DNN architectures confirmed that the OSLs enhanced the TSE significantly. Moreover, the TKD notably improved the student model's performance compared to the baseline trained only with WLs.	-
dc.format.extent	5	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	IMPROVING TARGET SOUND EXTRACTION WITH TIMESTAMP KNOWLEDGE DISTILLATION	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ICASSP48485.2024.10447525	-
dc.identifier.scopusid	2-s2.0-85195372806	-
dc.identifier.wosid	001285850001142	-
dc.identifier.bibliographicCitation	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1396 - 1400	-
dc.citation.title	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings	-
dc.citation.startPage	1396	-
dc.citation.endPage	1400	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Acoustics	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Imaging Science & Photographic Technology	-
dc.relation.journalWebOfScienceCategory	Acoustics	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Imaging Science & Photographic Technology	-
dc.subject.keywordAuthor	privileged knowledge distillation	-
dc.subject.keywordAuthor	Target sound extraction	-
dc.subject.keywordAuthor	timestamp information	-

Files in This Item: There are no files associated with this item.

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE