Acoustic-Scene-Aware Target Sound Separation With Sound Embedding Refinement

Kim, Yungyeo; Chang, Joon-hyuk

doi:10.1109/ACCESS.2024.3402736

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Acoustic-Scene-Aware Target Sound Separation With Sound Embedding Refinement

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Yungyeo	-
dc.contributor.author	Chang, Joon-hyuk	-
dc.date.accessioned	2025-12-11T01:00:15Z	-
dc.date.available	2025-12-11T01:00:15Z	-
dc.date.issued	2024-05	-
dc.identifier.issn	2169-3536	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209718	-
dc.description.abstract	Target sound separation (TSS) aims to separate specific sounds of interest, like a speech or a musical instrument, from complex acoustic environments with multiple overlapping sounds. In realistic scenarios, the important sounds that we want to hear can differ depending on transitions in the surrounding acoustic scene. This study addresses the problem of acoustic-scene-aware TSS, which separates predefined sets of target sounds considered significant for the current acoustic environment. Predefined sets of target sounds were determined beforehand based on the expected acoustic scenes. For example, the sound of a bicycle bell is predefined as the target sound in a park scene and separated from a mixture of various sounds. As a solution, we propose a novel approach called Acoustic-SCene-Aware Target sound separation with sound Embedding Refinement (SCATER). It refines pre-trained sound embeddings into acoustic-scene-aware representations to guide the separation of specific target sounds based on the surrounding scene. SCATER adopts a multiple instance learning-based acoustic scene classification system for rapid response to scene changes. The refined sound embeddings serve as cues for the TSS model, enabling the separation of different target sounds across various acoustic scenes. Experimental results demonstrate the superiority of SCATER over an approach that combines sound separation and scene classification separately.	-
dc.format.extent	11	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Acoustic-Scene-Aware Target Sound Separation With Sound Embedding Refinement	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ACCESS.2024.3402736	-
dc.identifier.scopusid	2-s2.0-85194062178	-
dc.identifier.wosid	001231354300001	-
dc.identifier.bibliographicCitation	IEEE Access, v.12, pp 71606 - 71616	-
dc.citation.title	IEEE Access	-
dc.citation.volume	12	-
dc.citation.startPage	71606	-
dc.citation.endPage	71616	-
dc.type.docType	Article in press	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.subject.keywordPlus	SPEAKER EXTRACTION	-
dc.subject.keywordAuthor	Acoustic scene classification	-
dc.subject.keywordAuthor	embedding refinement	-
dc.subject.keywordAuthor	scene awareness	-
dc.subject.keywordAuthor	sound embedding	-
dc.subject.keywordAuthor	deep neural network	-
dc.subject.keywordAuthor	target sound separation	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10534351	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE