Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Deeply supervised curriculum learning for deep neural network-based sound source localization

Full metadata record
DC Field Value Language
dc.contributor.authorBaek, Min-Sang-
dc.contributor.authorYang, Joon-Young-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2023-10-10T02:36:55Z-
dc.date.available2023-10-10T02:36:55Z-
dc.date.created2023-10-04-
dc.date.issued2023-08-
dc.identifier.issn2308-457X-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/191802-
dc.description.abstractDeep neural network (DNN) has made impressive progress in sound source localization (SSL) tasks with the hard n-hot labels that represent specific directions-of-arrivals (DOAs). However, recent study suggested soft DOA labels, considering the correlations between targets and nearby DOAs. In this study, to effectively train a DNN using soft labels, we propose deeply supervised curriculum learning (DSCL) by adopting the two techniques for the DNN, deep supervision (DS) and curriculum learning (CL). We train a DNN to solve SSL problems progressing from easier to harder, expecting the DNN would gradually reduce the angular region of the target DOAs. It is gained by various resolution soft targets for the different DNN layers to deeply supervise the DNN, while increasing the angular selectivity of the targets from the early to late stages of training by CL. Proposed method was verified on datasets with multi-speakers, and exceeded the hard-label methods with great improvements.-
dc.language영어-
dc.language.isoen-
dc.publisherInternational Speech Communication Association-
dc.titleDeeply supervised curriculum learning for deep neural network-based sound source localization-
dc.typeArticle-
dc.contributor.affiliatedAuthorChang, Joon-Hyuk-
dc.identifier.doi10.21437/Interspeech.2023-2451-
dc.identifier.scopusid2-s2.0-85171561839-
dc.identifier.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v.2023-August, pp.3744 - 3748-
dc.relation.isPartOfProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.titleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.volume2023-August-
dc.citation.startPage3744-
dc.citation.endPage3748-
dc.type.rimsART-
dc.type.docTypeConference paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusAcoustic generators-
dc.subject.keywordPlusDeep neural networks-
dc.subject.keywordPlusDirection of arrival-
dc.subject.keywordPlusSpeech communication-
dc.subject.keywordPlusAngular regions-
dc.subject.keywordPlusCurriculum learning-
dc.subject.keywordPlusDeep supervision-
dc.subject.keywordPlusDirectionof-arrival (DOA)-
dc.subject.keywordPlusLocalization problems-
dc.subject.keywordPlusNetwork-based-
dc.subject.keywordPlusSoft labels-
dc.subject.keywordPlusSoft targets-
dc.subject.keywordPlusSound source localization-
dc.subject.keywordPlusTarget direction-
dc.subject.keywordPlusCurricula-
dc.subject.keywordAuthorcurriculum learning-
dc.subject.keywordAuthordeep neural network-
dc.subject.keywordAuthordeep supervision-
dc.subject.keywordAuthordirection-of-arrival-
dc.subject.keywordAuthorsound source localization-
dc.identifier.urlhttps://www.isca-speech.org/archive/interspeech_2023/baek23_interspeech.html-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE