Cited 0 time in
Acoustic-Scene-Aware Target Sound Separation With Sound Embedding Refinement
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Yungyeo | - |
| dc.contributor.author | Chang, Joon-hyuk | - |
| dc.date.accessioned | 2025-12-11T01:00:15Z | - |
| dc.date.available | 2025-12-11T01:00:15Z | - |
| dc.date.issued | 2024-05 | - |
| dc.identifier.issn | 2169-3536 | - |
| dc.identifier.issn | 2169-3536 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209718 | - |
| dc.description.abstract | Target sound separation (TSS) aims to separate specific sounds of interest, like a speech or a musical instrument, from complex acoustic environments with multiple overlapping sounds. In realistic scenarios, the important sounds that we want to hear can differ depending on transitions in the surrounding acoustic scene. This study addresses the problem of acoustic-scene-aware TSS, which separates predefined sets of target sounds considered significant for the current acoustic environment. Predefined sets of target sounds were determined beforehand based on the expected acoustic scenes. For example, the sound of a bicycle bell is predefined as the target sound in a park scene and separated from a mixture of various sounds. As a solution, we propose a novel approach called Acoustic-SCene-Aware Target sound separation with sound Embedding Refinement (SCATER). It refines pre-trained sound embeddings into acoustic-scene-aware representations to guide the separation of specific target sounds based on the surrounding scene. SCATER adopts a multiple instance learning-based acoustic scene classification system for rapid response to scene changes. The refined sound embeddings serve as cues for the TSS model, enabling the separation of different target sounds across various acoustic scenes. Experimental results demonstrate the superiority of SCATER over an approach that combines sound separation and scene classification separately. | - |
| dc.format.extent | 11 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Acoustic-Scene-Aware Target Sound Separation With Sound Embedding Refinement | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ACCESS.2024.3402736 | - |
| dc.identifier.scopusid | 2-s2.0-85194062178 | - |
| dc.identifier.wosid | 001231354300001 | - |
| dc.identifier.bibliographicCitation | IEEE Access, v.12, pp 71606 - 71616 | - |
| dc.citation.title | IEEE Access | - |
| dc.citation.volume | 12 | - |
| dc.citation.startPage | 71606 | - |
| dc.citation.endPage | 71616 | - |
| dc.type.docType | Article in press | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Telecommunications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Telecommunications | - |
| dc.subject.keywordPlus | SPEAKER EXTRACTION | - |
| dc.subject.keywordAuthor | Acoustic scene classification | - |
| dc.subject.keywordAuthor | embedding refinement | - |
| dc.subject.keywordAuthor | scene awareness | - |
| dc.subject.keywordAuthor | sound embedding | - |
| dc.subject.keywordAuthor | deep neural network | - |
| dc.subject.keywordAuthor | target sound separation | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10534351 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
