Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Spatially Weighted Contrastive Learning for Robust Sound Source Localization

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Hyun-Soo-
dc.contributor.authorYang, Da-Hee-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2025-11-20T01:30:30Z-
dc.date.available2025-11-20T01:30:30Z-
dc.date.issued2025-08-
dc.identifier.issn2958-1796-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209223-
dc.description.abstractWe propose a spatially weighted contrastive loss (SWeC loss) for sound source localization in real-world scenarios using multi-channel speech data. In multi-channel localization, phase differences between microphone channels provide critical cues for estimating the azimuth angle of incoming speech. To effectively extract azimuth information, we leverage contrastive learning and introduce a novel loss function that incorporates spatial relationships between azimuth classes. Specifically, our loss assigns weights to negative pairs based on their angular distance, penalizing high similarity between embeddings corresponding to distant angles. Furthermore, we propose a contrastive data generation method tailored to multi-channel localization, enhancing the effectiveness of contrastive learning. Experimental results demonstrate that the proposed loss function and data generation strategy significantly improve localization performance.-
dc.format.extent5-
dc.language영어-
dc.language.isoENG-
dc.publisherInternational Speech Communication Association-
dc.titleSpatially Weighted Contrastive Learning for Robust Sound Source Localization-
dc.typeArticle-
dc.identifier.doi10.21437/Interspeech.2025-2666-
dc.identifier.scopusid2-s2.0-105020083243-
dc.identifier.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp 2490 - 2494-
dc.citation.titleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.startPage2490-
dc.citation.endPage2494-
dc.type.docTypeConference paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusContrastive Learning-
dc.subject.keywordPlusMicrophones-
dc.subject.keywordPlusSpeech communication-
dc.subject.keywordAuthorcontrastive learning-
dc.subject.keywordAuthorsound source localization-
dc.identifier.urlhttps://www.isca-archive.org/interspeech_2025/kim25v_interspeech.html-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE